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The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 

4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 
Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 
To insure that the Brief Report will be no 
longer than one printed page, its typescript, 


including all matter except the title and the 
author’s lines, must not exceed 80 lines av- 
eraging 42 characters and spaces in length. 
Set the typewriter margins for short lines of 
42 characters, which are 3.5 inches long in 
elite typing, and 4.2 inches long in pica. 

The manuscript of the Brief Report must 
be double spaced throughout. Except, for its 
short lines, it follows the standard style of 
the 1957 revision of the APA Publication 
Manual. Headings, tables, and references are 
avoided or, if essential, must be counted in 
the 80 lines. Each Brief Report must be ac- 
companied by a footnote in the style below, 
which is typed on a separate sheet and not 
counted in the 80-line quota: 


1An extended report of this study may be ob- 
tained without charge from John Doe, 300 Market 
St., Prospect 6, Mass. (giving the author’s full name 
and address), or for a fee from the American Docu- 
mentation Institute. Order Document No. ——, re- 
mitting $ for microfilm or $—— for photo- 


copies 


Extended report. Because the extended re- 
port is intended for photoduplication, and is 
not copy to be sent to a printer, its style 
should differ in several ways from that of 
other manuscripts: (@) The extended report 
should be typed with single spacing for 
economy in duplication. (b) Tables and fig- 
ures should be placed adjacent to the text 
which refers to them. A caption should be 
typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, 
the full report is prepared in the style speci- 
fied by the Publication Manual. 
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DEPTH OF INTERPRETATION AND VERBAL 
RESISTANCE IN PSYCHOTHERAPY ° 


JOSEPH ¢ 


University of 


Much of the research on psychotherapy has 
been oriented toward identifying the factors 
which contribute to the success or failure of 
various techniques. This study follows a more 
recent trend toward utilization of the process 
of psychotherapy as a source for investigating 
variables of interpersonal behavior (Bordin, 
Cutler, Dittmann, Harway, Raush, & Rigler, 
1954; Collier, 1953; Raush, Sperber, Rigler, 
Williams, Harway, Bordin, Dittmann, & Hays, 
1956). 

The purpose of the study was to observe 
the effects of varying depths of interpretation 
on resistance. There are many disagreements 
in the literature on psychotherapy concern- 
ing the proper application of interpretation, 
but it is unclear whether these are truly theo- 
retical distinctions or problems of definition. 
The most obvious disagreement is exemplified 
in Fenichel’s (1941) advice to interpret just 
beyond the preconscious, and Rogers’ (1951) 
suggestions to reflect and clarify only what 
the patient has already stated. Fenichel and 
Rogers, however, reach agreement in their op- 
position to even judicious use of deep inter- 
pretations advocated by Berg (1947) and oc- 
casionally by others (Klein, 1932; Stekel, 
1911). 

Auld and Murray (1955) have concluded 
that research in the area of the effects of 
therapist interpretations has indicated very 
little more than, ‘ there is no single type 
of response by the therapist that works best 


1 Based on a doctoral dissertation submitted to the 
University of Michigan, 1957. The research was made 
possible through the support of U.S.P.H.S. Project 
M-516; Analyses of Therapeutic Interaction, E. S 
Bordin, principal investigator. The author wishes to 
thank the members of the project for their coopera- 
tion, and to express particular appreciation to Harold 
L. Raush for his assistance and guidance 
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under all circumstances.” While this is un- 
doubtedly true, there remains the problem of 
specifying the effects of various therapist re- 
sponses under conditions. The pre- 
dictions for the present study were derived 
largely from Fenichel (1941), but they were 
exploratory in intent. Simply stated, the par- 
ticular proposition being studied was that 
deep interpretations lead to the most resist- 
ance, moderate interpretations lead to the 
least resistance, and superficial interpretations 
fall between the other two levels as to their 
influence on resistance. 


diverse 


Method and Procedure 


Collier (1953) has suggested that inter- 
pretive activity in psychotherapy may be 
scaled on a continuum. Harway et al. (1953) 
have developed such a scale for depth of in- 
terpretation (referred to as D/I), and have 
provided information on reliability and va- 
lidity for the instrument (Raush et al., 1956) 
As the criterion for depth, the scale empha 
sizes the disparity between the therapist’s 
view of and the patient’s awareness of his 
own emotions and motives. Thus the ratings 
are independent of the genetic depth of any 
given interpretation. The seven-point scale 
ranges from superficial interpretations em- 
bodied in restatements of what the patient 
has just said to deep interpretations charac- 
terized as completely beyond the patient’s 
conscious grasp. 

For the purposes of this study the scale was 
divided into three areas which define the three 
levels of interpretation. Superficial interpreta- 
tions were either restatements of what the pa 
tient had just said or only slightly modified 
repetitions. These interpretations fall between 
1 and 2.4 on the scale (see Table 1). Mod- 
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Table 1 


Depth of Interpretation Scale 


Restate 
ment of 
material 
of which 
the patient 
is aware 


Therapist 
merely 
repeats the 
material 
of which 
the patient 
is fully 
aware 


Implied Therapist 


focusing connects 
with for the 
regard to _—ipatient 
material two 

of which 
the patient 
is aware 


aspects 
of the 
content 
of the 
previous 
patient 
statement 


by patient 


erate interpretations presented the material in 
a different manner or in a new context (be- 
tween 2.5 and 4.4 on the scale). Deep inter- 
pretations were those which were considered 
to deal with material which was outside the 
patient’s awareness (from 4.5 to 7 on the 
scale). 

Six categories representing verbal aspects of 
resistance were derived after a search of the 
literature. The categories were intended to de- 
scribe the status of communication from pa- 
tient to therapist. The rationale was that re- 
sistance could be measured both positively 
(i.e., pursuing the therapeutic task) and nega- 
tively (i.e., avoiding the therapeutic task in 
various ways). The therapeutic task in this 
case was simply that the patient speak as 
freely and completely as is possible. It was 
clear that this concept of resistance is lim- 
ited in that (a) only verbal manifestations of 
resistance were tapped, and (6) only the su- 
perficial or what Glover (1955) has called 
“obvious” resistances were studied. 
The three positive categories were: 


Exploration category. Rate the degree to which the 
patient is actually exploring either new material or 
material which may have been discussed at some 
previous time. The patient may be introducing as- 
sociations, ideas, thoughts, feelings, perceptions, etc., 
about the new or old material. He is perhaps at- 
tempting to verbalize either more extensively or in- 
tensively; he is trying to conceptualize, abstract, or 
concretize some aspect of whatever content is under 


Reformu 
lation of 
the behav 
ior of the 
patient 
during the 
interview 
in a way 
not ex 

plic itly 
recognized 
previously al 


6 
Use of Therapist 
spec ulates 
as toa 


rherapist 
preceding 
patient 

statement 


re spons¢ 
deals with 
possible inferences 
to ex childhood about 
situation 
process that might 
that has relate to 
been current 
building 


emplify a material 
completely 
removed 
from the 
patient patient’s 
up during feeling 
1 inter 


view and of 


awareness 


which the 
patient is 
seemingly 


unaware 


discussi 
be rat 


mention of an idea or feeling would 
e higher exploration as the patient 


investiga L I 
Self-scrutiny category. Rate the degree to 
the patient is examining himself. He may be reacting 
statements or those of the therapist, but 


which 


case he is investigating reasons for, or ex- 
reactions to, his own feelings or statements 
Rate the degree to which 
is reporting his own feelings, emotional 


tentation category 


eas, etc., in whatever topic is being dis- 
would be contrasted with the 
about other people’s reactions, or about 
nts without mentioning his own feelings 
eactions to, these events 


patient 


hree negative categories were: 
n category. Rate the degree to which the 
statement contains the qualities of opposi- 


nial toward the therapist or therapy. The 
j 


opposition need not have a directly expressed object 


You maj 


tient statement which expressed doubt or negativism 


for example, rate opposition from a pa 


therapist or therapy as an object is only in- 

ferred 
Su per ficiality Rate the degree to which 
patient is being superficial. Try to evaluate to 
xtent he is saying only 


category 


t exter what is surface and 
obvious. For example, to what extent is he empha 
sizing details and particulars at the 
deeper meaning 


Blocking 


expense of 


Rate the degree to which the 
patient seems to be substituting an acceptable 
thought for one he cannot bring himself to express 
directly. Important cues would be areas which are 
“glossed over,” or instances where the patient ab- 
ruptly interrupts himself and continues in a different 
vein. Other cues would be pauses, hesitations, 
bling, stuttering, repetitions, et« 


category 


stum 
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Table 2 


Horst Reliability Indices for Three Judges 


on the Resistance Categories 


Reliability and Interrelations of Measures 


Interjudge agreement had been obtained on 
the measures during their development using 
case material similar to that used in the final 
procedure, and was also calculated for the 
under consideration. In every 
instance interjudge agreement was obtained 
by means of the Horst reliability 
(1949) 

The reliability of the D/I measure had re 
ceived considerable attention, but as a reli- 
ability check for this study three judges rated 
two interviews which were not to be included 
in the final data. The combined ratings of 
three judges yielded Horst indices of .77 and 
.81, and since this index, unlike the usual cor- 
relational index, is a direct approximation of 
the variance accounted for, these reliability 
figures were deemed sufficient 


case materials 


index 


Three different judges using the resistance 
categories independently rated the same two 
interviews as had been used for the D/I reli- 
ability check. 

The results from each resistance category 
are given in Table 2. The reliability for all 
Blocking 
Interjudge 


categories except was considered 


acceptable agreement was also 


Table 


checked for the ratings of the final case ma 
terial and were of the same order as the fi 
ures reported in Table 2. The Blocking cate- 
gory reliability figure remained low (e.g., on 
the order of .20) despite further training, and 
this category was not included in the final 
results. 


a- 
o- 
4 


The several resistance categories were then 
intercorrelated since they had been presumed 
to measure different aspects of resistance. As 
can be seen in Table 3, this assumption was 
unwarranted. 

The categories seemingly represent only 
two independent aspects of resistance: Op 
position and either Exploration, Self-scrutiny, 
Self-observation, It remains 


an open question as to whether it was the 


wr Superficiality. 


definitions, the judges, or the conceptualiza- 
tion of resistance which did not permit finer 
discrimination, but for the purpose of clearer 
exposition results will be reported fully only 
for the Exploration and Opposition categories 


Prov é dure 


In order to obtain data from as broad a 
range of cases as possible and still provide the 
detailed and extensive information usually re 
quired to make judgments about psychother 
apy, the data were collected in two ways. One 
source was all responses in a sequence of five 
consecutive interviews from a single case (In 
The second source was 21 dif 
units of 11 


randomly selected (Ex 


tensive case) 


ferent cases from which succes 


sive responses were 


tensive cases). All cases were described by 


The Superficiality category had been prepared as 


a negative aspect of re e, but the 


| ile iccom 
panying this category was reversed, and the judg: 
were rating on some dimension of “me 


thus the 


‘ 


iningfulness” 


high positive correlation with Exploration 
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their therapists as neurotic problems although 
no attempt was made to select cases by noso- 
logical classification. The schools represented 
by the therapists included nondirective, modi- 
fied psychoanalytic, Adlerian, and eclectic. 
Therapists were psychologists and psychia- 
trists, and ranged from approximately one to 
ten years in experience. 

Judges were advanced graduate students or 
faculty members in clinical psychology at the 
University of Michigan, and all judges had 
100 hours or more of supervised experience 
as psychotherapists. Three judges, none of 
whom had been involved in the original reli- 
ability checks, did all of the ratings on the 
resistance categories. A total of 12 new judges 
rated D/I. Each interview was rated by three 
judges, and ratings were done independently. 
All ratings were done on transcriptions of in 
terviews taken directly from electromagnetic 
recording tapes. Typescripts were done by a 
secretary trained to follow a uniform pro- 
cedure. In addition to actual conversation all 
pauses, stutters, slurs, etc. and expressions 
such as laughter, sighs, gasps, groans, etc. 
were included. All names, places, and other 
identifiable materials were 


coded to insure 


confidentiality for the patient. Judges were 


given general instructions which cautioned 
them not to read ahead of the responses they 
were rating, and not to change a rating once 
they read further in the interview. For each 
interview, judges were given a rating booklet 
with blank seven-point scales, and either the 
D/I definition and scale or the category defi- 
nitions, each of which was accompanied by 
an illustrative seven-point scale with only the 
end points identified. 

Judges who rated D/I rated therapist re- 
sponses in the sequence in which they ap- 
peared in the interview. A response was de- 
fined as a therapist statement occurring be- 
tween two patient statements, and judges 
were requested to rate each response. Since 
rating D/I requires of the judges an estima- 
tion of the patient’s awareness it was neces- 
sary to include the patient statements in the 
material as well as the therapist remarks. 

Judges who rated resistance were presented 
with only the patient statements in their 
proper sequence. Therapist remarks were de- 
leted in order to avoid the possibility that 
the ratings might be influenced by judges’ 


Speisman 


theoretical biases as to the relationship be- 
tween therapist and patient statements (see 
Harway et al. [1953], for a discussion of this 
procedure). 

The ratings on the intensive case included 
all of the responses in the interviews, and no 
additional context material was provided. In 
the extensive series the rater was allowed to 
read that portion of the interview which pre- 
ceded the 11 responses to be rated. 


Results 


All analyses were done by a modification 
of Kendall’s tau (1947), which allows for 
ties in ranks to the extent of both variables 
being dichotomized, and in the case of com- 
plex tables direction of association is more 
readily obtainable than with the more com- 
monly used chi square procedures 

Table 4 presents the taus and significance 
levels for comparisons among each of the 
three levels of interpretations as to their ef- 
fects on Exploration and Opposition. In ad- 
dition, the ratings of responses following Deep 
interpretations were compared with a pool of 
the ratings of responses following Moderate 
and Superficial interpretations (Not Deep in 
the table). The data presented in Table 4 in- 
dicate that both Superficial and Moderate in- 

‘rpretations were followed by more Explora- 
tion and less Opposition (i.e., lower resist- 

than were Deep interpretations, and 
that Moderate interpretations were followed 
by patient statements rated lower on resist- 
ance than were Superficial interpretations. 
These results support the formulation ad- 
vanced earlier that Deep interpretations lead 
to the most resistance, Moderate interpreta- 
tions lead to the least resistance, and Super- 
ficial interpretations fall between the other 
two levels as to their influence on resistance. 

While all the results were in the predicted 
direction and 12 of the 16 comparisons were 


ance 


statistically significant, the degree of associa- 
tion indicated by the taus was not high. 
Presumably other factors, including other as- 
pects of resistance, operate in the expression 
of opposition and exploration 

As another approach to the problem, shifts 


s All comparisons including those with the resist- 

e categories not reported in this section were in 

predicted direction, and the number of signifi 
ilts was comparable to those reported. 
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Table 4 


Relationships Between Levels of Interpretation ar 


Ex] 


Intensive 
parison of 
tation Levels Vs Tau 
Not Deer vs 
Superfi 
Moderate vs Deep 
Moderate 


Deep 


ial vs. Deep 


18** 
19 

24** 
24** 


ouperncia 


from one interpretation level to another were 
compared with the shifts. That is, 
shifts from Moderate to Superficial interpreta- 


reverse 


tions were compared with shifts from Super- 
ficial to Moderate interpretations, etc. as to 
the influence of each shift on the expression 
of Exploration and Opposition. This allowed 
a more direct approach to the interaction be- 
tween the various depths of interpretations 
than the earlier analy ses 
pared each of the 
to their general 
cases 

Table 5 presents the taus and significance 
levels for the comparisons of the effects of 
shifts in interpretation levels. The data indi- 
cated that shifts from Deep to Not Deep 
combination Moderate and 
interpretations were followed by 


which only com- 
interpretation as 
within 


le vels of 


influence or across 


(i.e 
ficial ) 


of 


Super- 


in- 


1 Degree of Resistance 


loration 


creased Exploration and decreased Opposition 
(lowered resistance), while higher resistance 
followed the reverse shift. Shifts from Mod 
erate to Superficial interpretations were fol- 
lowed by resistance, and shifts 
from Superficial to Moderate interpretations 
were followed by 


int reased 


decreased resistance 

As had the previous results, these data in- 
dicated that 
lowed by the 


Deep interpretations were fol- 
highest levels of resistance, and 
Moderate interpretations were followed by 
lower resistance than Superficial interpreta 
tions. For these analyses, however, the rela- 
tionship between Superficial and Deep inter 
pretations was indeterminate. Individual com- 
parisons of shifts between Deep and Moderate 
and Superficial interpretations could not be 
made, since the N in these cases became too 
small. It should be noted that in contrast to 
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the previous analyses, the amount of variance 
accounted for is substantial. 
Validity 

An integral part of this study was the de- 
velopment of the resistance measures, and 
thus the question of validity, as well as reli- 
ability and interrelationships, requires some 
comment. In the absence of outside criteria, 
one relevant procedure was a statement of 
extent of support, weight of evidence, or con- 
struct validity. In this sense the two cate- 
gories of resistance were successful measures. 
Exploration provided seven significant results 
in eight comparisons, and Opposition pro- 
vided five out of eight; both beyond chance 
expectation. These results must be interpreted 
with caution because of the circularity in- 
volved in this use of construct validity, and 
because some comparisons involved the same 
data as others. 

Another, perhaps indirect, approach to va- 
lidity was to demonstrate the invalidity of 
obvious alternative interpretations of the re- 
sults. One potential criticism of the Explora- 
tion category was that it measured nothing 
more than the amount of verbiage the patient 
produced and that this did not necessarily in 
dicate the relative absence of resistance that 
the category was supposed to represent. Some 
relationship was to be expected between num- 
ber of words and Exploration given the gen- 
eral instructions to patients to speak freely, 
but this should account for only a minimum 
of the category’s usefulness. A Pearson r of 
.35 was obtained between number of words 
and Exploration ratings, and this correlation 
was compared with reliability figures of .88 
and .79 for Exploration. As a further check, 
partial correlations were obtained with Ex- 
ploration, Self-scrutiny and Superficiality, in 
each holding constant the number of 
words. Exploration and Self-scrutiny corre- 
lated .72, and the resulting partial correlation 
was .66. Exploration and Superficiality corre- 
lated .67, and when number of words was 
held constant the correlation obtained was 
.60. Both the comparison of correlation be- 
tween Exploration and number of words, and 
the partial correlations were considered to 
support the idea that Exploration involves 
more than the mere length of the patient’s 
statement. 


case 
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Effects of Resistance on Interpretation 


It is quite conceivable that the therapist’s 
level of interpretation is influenced by the de- 
gree of resistance he encounters. For exam- 
ple, if a therapist is faced with high resist- 
ance, he may resort to more extreme, and 
thus deeper, interpretations in an effort to re- 
duce the level of resistance. Accordingly, the 
degree of resistance preceding various levels 
of interpretation were explored. No significant 
relationships were found, although Opposition 
appeared to have more of an influence on 
subsequent interpretation than did Explora- 
tion. The relationship between preceding Ex- 
ploration and following levels of interpreta- 
tion was indicated by taus of .02 and .06 for 
intensive and extensive cases respectively, and 
comparisons between preceding Opposition 
and subsequent interpretation levels yielded 
taus of .13 and .10. 


Discussion 


Within the limits of the study, three gen- 
eral conclusions may be stated: (a) Therapist 
interpretations influence pa- 
(b) the resistance level of a 


systematically 
tient responses; 
patient remark does not significantly influence 
the depth of the immediately following thera- 
and (c) the Moderate 
interpretation is effective in 
minimum that aspect 
of resistance which varies with interpretative 


pist interpretation; 


level of most 
maintaining levels of 
activity 


follow 


and when Moderate interpretations 
Superficial interpretations 
they are more likely to reduce resistance than 


are the 


| Jeep or 


reverse orders. 

Very little work has been done which can 
be compared directly with the present investi- 
There have been several studies of the 
effects of 


gation 
therapist activity on such concepts 
as insight or progress, but the results appear 
to be contradictory. Dittmann (1952) found 
that patient “Progress” enabled 


was pri- 


marily by interpretations which were some- 
what beyond the reflective level. Bergman 
that 


was the only tech- 
which led to “Insight” or “Continued 


(1951 on the other hand, concludes 
‘Reflex tion of 


nique 


Feeling”’ 


Exploration.” Insofar as the definitions and 


operations coincide among the various stud- 


ies, the results obtained in the investigation 
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reported here would support Dittmann’s find- 
ings and would not support Bergman. 

Gillespie (1953) approached the concept 
of resistance directly, but his results were 
equivocal. Gillespie found that Hostile Ex- 
pressions were directed toward the therapist 
or toward the therapy process following In- 
terpretation. Since his definition of interpreta- 
tion was closest to the Moderate level used 
here, Gillespie’s results conflict with the re- 
sults obtained with the Opposition category. 
These conflicting findings may be due solely 
to differences in definition and operationali- 
zation of the concepts. On the other hand 
nondirective therapy, from which Gillespie’s 
work stems, is directly concerned with hostile 
expressions as indicators of resistance and as 
indicators of negative findings for the prog- 
ress of therapy in general, and Gillespie’s 
findings may reflect factors peculiar to non- 
directive procedures which were masked in 
the present study. Contrary to his expecta- 
tions, Gillespie found that indices of resist- 
ance other thah Hostile Expressions were as- 
sociated with “accurate clarification” and 
content” which would tend 
to support the results obtained with the Su- 
perficial level of interpretation in the current 
study. 


“restatement of 


The results of this study lend support to 
Fenichel (1941) from whom the hypotheses 
Within the framework that this 
theorist has presented, the findings may be 
most generally Moderate 
interpretations encourage free expression by 
producing a new frame of reference or by 
making new connection for materials which 
are close to consciousness. Deep interpreta- 
tions, if accurate, impinge on materials at 
their most regressed stage, but do not stimu- 


were drawn 


stated as follows: 


late expression because there is no direct con- 
nection with the conscious ego. Conversely, 
Superficial interpretations function only with 
what is already part of the conscious ego, and 
so do not provide encouragement for further 
expression. 


Summary 
Three levels of interpretation, Deep, Mod- 
erate, and Superficial, and two aspects of ver- 


bal resistance, Opposition and Exploration, 
were rated in psychotherapy protocols. The 


three levels of interpretation were compared 
as to the degree of resistance which followed 
each level, and as to the change in degree of 
resistance when the therapist shifted from 
one interpretive level to another. Moderate 
interpretations were more effective than were 
Superficial or Deep interpretations in main- 
taining lower levels of resistance and in re 
ducing levels of Reference 
made to the reliability, validity, and interrela- 
tions among several categories of resistance. 
Previous relevant studies and some aspects of 


resistance. was 


theoretical positions were briefly discussed. 
Received April 21, 1958 
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WELSH’S FACTOR SCALES A AND R* 
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The tendency of subjects taking a person- MMPI. Starting with the simple hypothesis 
ality inventory to try (consciously or uncon-_ that psychotics have a response bias to acqui- 


sciously) to give a favorable picture of them- esce to deviant responses and that neurotics 


selves has become a matter of increasing 
concern for personality assessors, since the 
appearance of the Meehl and Hathaway pa- 
per (1946) on the subject. The % scale of 
the MMPI was developed as an attempt to 


response bias to deny deviant re- 
Barnes (1956b) presents evidence 


the two major factorial dimensions of 
MMPI (psychoticism and neuroticism) 
striking set properties. In addition 


(1957b) has pointed out that such 


evaluate the extent of this test-taking atti- 

tude in a given subject and to numerically corrective features as the K scale and the 
“correct” those scales which seemed most -obvious keys of Wiener (1948) ar 
vulnerable to this type of distortion. The his- to a response bias interpretation as 
torical antecedent of this procedure lay in we n important feature of the Berg 
the earlier efforts of Humm and his collabo- nes line of reasoning is a studied lack of 
rators (Humm & Humm, 1944: Humm, regard for the specific content of any of the 
Storment, & Iorns, 1939) to obtain finer items involved. 

empirical discriminations by a joint consid- Still a third approach is represented by 
eration of raw score and “No-count” (num the concept of item “social desirability” that 
ber of no’s). Both of these correction pro- n the focus of a great deal of recent 
cedures were made with reference to an ex- research by Edwards (1957) and his follow- 
ternal criterion, and there seems to have been Ithough there is little that would con- 
an explicit recognition that the test-taking | distinguish this notion from the 
attitude itself may contain valid (criterion- general “fake good” dimension that Meehl 
relevant) variance. Another avenue of ap- thaway had in mind, there does seem 
proach that converges on the same probler emphasis on item content that is not 
has been the generalization of Cronbach’s present in the Berg—Barnes studies. In addi- 
concept of “response set” (Cronbach: 1946 i 
1950) to include aberrant or statistically (uncorrelated) response sets, Ed- 
deviant response sets by Berg (1955, 1957) wards (1957) feels that “social desirability” 
and the subsequent implementation of this he most important single dimension on 
notion by Barnes (1956a, 1956b) with the which to locate personality statements.” Ed- 


hereas Berg and Barnes postulate two 


1 This investigation was supported in part by a 4" ips this dimension by having judges 
research grant, M-2215, from the National Instituts 
of Mental Health of the National Institutes of 
Health, Public Health Service. Data for this studs 
a a i. ra eo aa ' “sy — roup, it appears that much of the item vari- 
a dee emg a caaietieier Sestte unce in the MMPI can be accounted for by 


available to us. this dimension (Edwards, 1957; Fordyce 


he “social desirability” of inventory 
When these ratings are correlated with 
endorsement frequency in a heterogeneous 
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1956; Hanley, 1956). Further, as Fordyce 
(1956) has shown, when a rationally derived 
“social desirability scale” (S-D) is correlated 
with the MMPI scales, the correlation profile 
is highly similar to the distribution of load- 
ings among the clinical scales on the “psy- 
choticism” factor. 

The findings of Barnes and Fordyce with 
regard to the “psychoticism” factor of the 
MMPI are strikingly similar. It might be 
argued that Barnes compared acquiescence 
with psychoticism, while Fordyce compared 
denial (58 of the 79 S-D items are keyed 
false) with the same factor. However, Ed- 
wards (1957) explicitly denies a strict re- 
sponse set interpretation of S-D, and it might 
be the “pureness” of Barnes’ response set 
measure that resulted in the noticeably better 
“fit” with the psychotic factor loadings in his 
study. Some rapprochement is gained by 
recognizing that Barnes is speaking of “re- 
sponse set” in regard to 
items, i.e., “deviant 


a special class of 
items. If one views Ed- 
wards’ social desirability rating procedure as 
a method of determining the “deviance” of 
items, then one might expect a close corre- 
spondence between statistical deviance and 
extreme social desirability ratings. 

The present paper calls attention to the 
characteristics of two MMPI scales that seem 
to nave special relevance to these studies. 
Welsh (1956) has reported two empirical 
scales which discriminate high and low scorers 
along the lines of the “psychoticism” and 
“neuroticism’ Scale A is heavily 
loaded on the psychotic factor and contains 
39 items (38 of which are keyed “true’’). 
Scale R is heavily loaded on the neurotic fac- 
tor and contains 40 items (40 of which are 
keyed “false”). Welsh labeled these dimen- 
sions “anxiety” and “repression” and it ap- 
pears that the A scale involves positive ad- 
missions of psychopathology while the R scale 
involves, among other things, a general denial 
tendency. Welsh (1956) originally rejected a 
response set interpretation of his scales by 
emphasizing that their correlation is close to 
zero. However, such a correlation would tend 
to support Barnes’ formulations regarding the 
operation of two independent response sets 


factors. 


In addition to their apparent response set 
properties, Scale A and Scale R seem to rep- 
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resent item pools of different social desir- 
ability values. The items which constitute the 
A scale appear to form a relatively homo- 
geneous group with respect to the dimension 
of positive statements or admissions of psy- 
chopathology. Considering the attitudes of 
our culture toward psychological illness, it 
seems reasonable to predict that the rated 
social desirability of answering “true’’ to such 
items would be quite low. Corroborative evi- 
dence for this impression comes from the fact 
that all items are “X” items when compared 
with the Minnesota normals, so that the be- 
havior implied by the statements is infre- 
quently endorsed in the general population 

The content of the R items is considerably 
more heterogeneous. Although it is difficult 
to form a consistent over-all impression of 
this group of items, it seems safe to say that, 
as a group, their rated social desirability of 
answering “true” tends to be higher than it 
would be for the A items. On the other hand 
the social desirability of answering “false’ 
does not appear to be as “undesirable” as an 
swering “true” to A items. Here it should be 
noted that 15 of the 40 items are “O” items 
which are scored in the same direction as that 
answered by the Minnesota normals. 

The present 
strate the 


tudy was designed to demon- 
properties of rational social desir- 
ability scales made up from these two item 
pools. Attention is focused on the rated social 
desirability of the two item pools and its ef- 
fect on the previously reported relationship 
between endorsement frequency and social 
desirability. In addition, the relationship be- 
tween two such social desirability scales and 
other MMPI scales is examined 
Experiment | 

The 79 items from Welsh’s A and R scales 
were printed in two forms of a rating booklet 
for judges. The first booklet requested judges 
to rate each item independently on a seven- 
point scale for true an- 
swers. That is, the judges were asked to as- 
sume that an item had been answered “true” 


social desirability 


by a subject and to assign a scale intensity 
value based on how people-in-general would 
regard the social desirability of such an an- 
swer. The second booklet was identical with 
the first, except that judges were requested to 
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assume that each item had been answered 
“false” by a subject and to assign a scale 
value based on how people-in-general would 
regard the social desirability of a “false” 
answer. 

Judges. There were 181 undergraduate men 
and women from the University of Rochester 
who served as judges of the social desirability 
of the item pool. These judges were instructed 
separately in four independent groups: (a) a 
Male-True group consisted of 55 fraternity 
men who rated the social desirability of an- 
swering “true” to the items; (5) a Female- 
True group consisted of 25 women students 
from a child psychology class who rated the 
social desirability of answering “true” to the 
items; (c) a Male-False group consisted of 
67 fraternity men who rated the social desir- 
ability of answering “false” to the items; and 
(d) a Female-False group consisted of 34 
women dental hygiene students enrolled in a 
psychology class who rated the social desir- 
ability of answering “false” to the items. 

The printed instructions on the front of the 
rating booklet were read before each group, 
and questions concerning procedure were en- 
couraged. It was felt that excellent coopera- 
tion was secured and that the nature of the 
task was clear to the judges. 


Results 


A median social desirability rating on the 
seven-point scale was determined for each of 
the 79 items in the four judging groups. In 
addition, the median ratings for the Male- 
True and Male-False groups were converted 
into values on an equal-interval scale using 


Table 
Median Sox 
Total Item Pools Under the 


Means and Standard Deviations of 


True Rating 


Item Pool Mear SD 
Men 

A 

R 

Total 
Women 

1 

R 


and Clark Rumrill 


the method of successive intervals suggested 
by Edwards (1956). The distribution of rat- 
ings given by the True rating groups were 
positively skewed with a piling up of ratings 
at the “undesirable” end of the scale. The 
distribution of False ratings showed a com- 
plimentary piling up at the “desirable” end 
of the scale. Although the social desirability 
ratings given to the pool of items were dis- 
tributed over the seven-point scale, there was 
a marked preponderance of undesirable rat- 
ings. 

Table 1 the means and standard 
deviations of the median ratings given to the 
A and R item pools under each of the rating 
conditions. From inspection alone it is clear 
that the social desirability values of the A 
items tend to be uniformly low, while the 
R items are more 


shows 


heterogeneous and of 
more moderate or “neutral’’ social desirabil- 
ity values 

\ comparison of the mean social desir- 
ability rating given to the A items, with the 
mean value assigned to the R items, yielded 
t values of 8.3 and 7.4 for men and women 
True raters, respectively, and ¢ values of 6.8 
and 6.4 for men and women False raters (77 


df). For both False rating groups this dif- 
ference may be considered significant at the 


)1 level. However, in the case of both True 
rating groups, the hypothesis that the A and 
R items share a common population variance 
can be rejected at the .01 level. Thus, we find 
that the R items tend to be of higher average 
social desirability value than the A items, but 
the reliability of this difference is indetermi- 


nate for the True rating groups because of 
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the significantly greater variability in ratings 
given to the R items. 


Experiment II 


Having determined the social desirability 
ratings for the pool of 79 items, the next step 
was the administration of these same items, 
in a conventional inventory format, to a dif- 
ferent but comparable population. A 217-item 
group short form of the MMPI was devised 
from which it was possible to score the se- 
lected scales: L, K, Hs, D, Hy, Hy-subitle 
and Pt, as well as Welsh’s Scales A and R 
which contain the 79 items previously rated. 

Subjects. There were 379 undergraduate 
men and womea from the University of 
Rochester who served as volunteer subjects 
for the short-form personality inventory.” 
Care was taken not to include subjects who 
had previously served as judges of the social 
desirability of the item pool. There were 237 
males and 142 females in the total sample. 

Social desirability measures. An index of 
social desirability response tendencies was ob- 
tained for each of the 379 subjects who had 
taken the short-form MMPI. By reference 
to the appropriate judging group (men or 
women) of Experiment I, it was possible to 
assign a social desirability value for subjects’ 
responses to each of the 79 previously rated 
items. By use of the data from both True and 
False rating groups, it was also possible to 
assign a value for each item, regardless of the 
direction of response chosen by the subject. 
In the absence of information concerning the 
most efficient scoring technique, several meth- 
ods of the same data were investi- 
A weighted social desirability in- 
dex was obtained by summing the median 
social desirability ratings for the 79 items. If 
an individual answered an item in a direction 
that was judged by his (her) like-sexed peers 
to be sociaily desirable, the weight that was 
added was larger than it would have been for 
an item answered in the opposite direction. 
Different items have different weights (me- 
dian ratings), and this index is simply the 
sum of the weights. (6) An unweighted so- 
cial desirability index was obtained by adding 
a 1 to the subject’s score each time an item 


scoring 
gated: (a) 


2We are grateful to Paul Obrist for his assistance 
in administering and scoring the inventory 
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was answered in the direction of social desir- 
ability and a O each time an item was an- 
swered in a socially undesirable direction. The 
rating reference group was the subject’s like- 
sexed peers and the decision to assign a 1 or 
a O was based on a comparison of the median 
rating for the item answered “true” and the 
median rating for the item answered “false.”’ 
(c) A transformed index was computed for 
the 237 men in the sample. This is essentially 
the same as the weighted index except that 
the median ratings on which it is based have 
been transformed to an equal-interval scale. 
In addition to these three indices based on 
all 79 items (Sd-T), each scoring method was 
further subdivided into the 39 items from the 
A scale (Sd-A) and the 40 items from the R 
scale (Sd-R). There were thus nine measures 
of social desirability derived from the same 
data. 
Results and Discussion 

Endorsement Frequency 


Item frequency counts were made of each 
of the 79 critical items in the short-form in 
ventory. These frequencies were tabulated 
separately for the 237 men and 142 women 
It was then possible to examine the correla- 
tion between the rated social desirability of 
answering “true” to the items and the num- 
ber of subjects, in the inventory sample, who 
actually did answer “true” to these items 
The relationship between the rated social de- 
sirability of answering “false” to the items 
and the number of people who answered 
“false’’ was similarly examined. These corre- 
lation coefficients may be found in the last 
two columns of Table 1 (Experiment I) 

In the total item pool, the relationship be- 
tween social desirability value and “endorse- 
ment” (answering “‘true’’) is of a similar or- 
der of magnitude to that reported in other 
studies (.79 for men and .76 for women). A 
similar relationship is seen in the R item pool 
(.75 for men and .71 for women). However, 
the relationship between social desirability 
and endorsement frequency found in the A 
item pool is only .41 for men and .50 for 
women. The attenuated correlations found in 
the case of the A item pool are probably re- 
lated to the decreased variability in both vari- 
ables introduced by employing an “obvious” 
item pool of uniformly low social desirability, 
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which leads to relatively uniform denial on 
the part of the subjects. The R items, despite 
their apparent “subtlety” and moderate so- 
cial desirability values, still reflect the close 
relation between these two variables reported 
in previous group studies. 

When a pool of items of uniformly low so- 
cial desirability is selected from the psychotic 
dimension, the previously reported relation- 
ship between endorsement frequency and so- 
cial desirability value.is not found. If we 
know the social desirability values of the 
items for college men, for example, we can 
account for only about 17% of the variance 
involved in endorsement. If this pool of items 
(Sd-A) is highly correlated with the MMPI 
clinical scales, it would seem that the rela- 
tionship was more a function of scoring di- 
rection (response set) than of item social de- 
sirability. 

The close relationship between endorsement 
frequency and social desirability in a pool of 
relatively subtle or neutral items from the 
neurotic dimension is notable, although it 
does not tend to reinforce Edwards’ position. 
He states: “Under any circumstances, it 
would seem to follow that, as the number of 
subtle or neutral items contained in a scale 
is increased, the relationship between prob- 
ability of endorsement and social desirability 
scale value would decrease” (Edwards, 1957, 
pp. 49-50). Despite this statement, it would 
seem that the demonstrated relationship be- 
tween social desirability and endorsement in 
the R item pool would necessitate a predic- 
tion of substantial correlations between social 
desirability scale Sd-R and the other scales 
of the MMPI which are presumably influ- 
enced by this variable. In correlating devi- 
ant false responses with neuroticism, Barnes 
(1956b) ignored the “0” item components 
of neuroticism. The Sd-R item pool is not 
highly “deviant” in a statistical or social de- 
sirability sense and is thus not directly rele- 
vant to his work. It is more relevant to the 
general “response-bias” formulation of Fricke 
(1956, 1957a, 1957b) which seems to include 
all “false” answers—subtle, neutral, and de- 
viant. 

Studies relating “denial” frequency (an- 
swering “false”) to independently rated so- 
cial desirability of answering False have not 
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been reported. In the present case, it is in- 
teresting to note (Table 1) that these two 
variables tend to be less correlated than the 
complimentary variables for True answers. 
In Experiment I, True and False ratings for 
the same items were highly negatively corre- 
lated—although not as highly as one might 
expect. In the male rating group, for exam- 
ple, the correlation between True and False 
ratings for the total 79-item pool was — .886. 
For the same male raters, the correlation in 
the A item pool was — .769 and the R item 
pool — .847. 


Correlations With Other Scales 


An 18 variable intercorrelation matrix was 
prepared, which contained the three social de- 
sirability indices yielded by different scoring 
methods (and their A and R subdivisions) 
and MMPI variables which could be 
scored from the short form. Separate correla- 
tions were computed for men and for women. 
Although the MMPI scales cannot serve as 
ultimate criteria for decisions regarding the 
most efficient scoring of the social desirability 
index, they do provide a rough practical guide 
as to the equivalence of the three scoring 
methods. The correlation of the weighted and 
transformed scales, for example, is .998, and 
their patterns of correlations with the other 
MMPI scales are so markedly similar that 
little seems to be gained by the laborious 
transformation procedures suggested by Ed- 


nine 


wards (1956) for obtaining an approximation 
to an equal interval scale. Similar findings 
with respect to this transformation were re- 
ported earlier by Hanley (1956). The un- 
weighted scale is also highly correlated with 


indices (.896 and .892 for 
males), and its similar pattern of correlations 
likewise throws into doubt the utility of the 
more system of adding specific 
For economy of presentation, only 
the correlations of the weighted social desir- 
ability measure with the other MMPI scales 
are presented. All findings may be general- 
ized to the other scoring methods. 


the other two 


elaborate 
weights 


complete table of intercorrelations, as well 
as the rated social desirability values of all items un 
der the different rating conditions, has been deposited 
with the American Documentation Institute 
Document No. 5774, remitting $1.25 for 35 mm 
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Table 2 

Vecighted Social Desirability Scales wi 
for 237 Men and 142 Wor 


Table 
total 79 ite 
(Sd-T) an 
and Sd-R) wit he i hat cé \d-A is col 
be scored in the present stud) mpara- bias properties of H 


» purposes, the 1 rhe outstanding feature his table is the 


with Edwards’ scale ien different pattern presented by Sd-R. Here we 


’ ; 


have a scale t canno 
equival | 
tionshiy 
not an anx y st ile. On the o 
, 4 lack of relation to Hy-sub an 

from mo o low so | desi lity i itself ( od ior men and 
whic] relatior } between endorse suggests tl 

7 found to repre SSi\ 
hold 1S il t OF ‘sponse bias 
interpretation (neither scoring direction ex- 
ceeds ¢ ) I } \ lated to K, 
moderately la i - ne l 
D, H sub) il ( hi ly li l l > not open 
anxiety scal t, A) that it could itself be tion, and 
considered suc] ! his pat I I do not aj | 
correlatio1 iy be accounted f by the out of 40 are keyed >for men and women 
‘anxiety-type” items seems clear from the con res} i » could be shown to 
taminated correlations of Sd-T and Sd-A (.95 liscriminate known social desirability faker 
for men and 6 10 n and by the simi it would have the advantage of being inde 
lar correla n patter! f the two ile pendent oil response bias dimensions 
Sd-A is also highly related to K, moderately A comparison of the present correlation 
related to the neurotic scales, and very highly with those obtained by Fordyce (1956) and 


related to the an et) scales ne relia Hanley (1957) sug ; ina » latter are 


between endorsement and social desirability more similar to Sd-A than to Sd-R. Sd-A has 
was not high in this pool and a response bias only one item in common with Ex, while S-D 


he correlat is possible overlaps with Sd-A on nine item Hanley 
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scale has interesting properties in that it con- 
tains the small group of MMPI items that are 
extreme in social desirability without being 
statistically deviant. The relationship be- 
tween endorsement and social desirability in 
this pool is the lowest reported (.27), while 
the correlations of the scale with the MMPI 
scales are substantial. Although Hanley ar- 
gues against a response set interpretation of 
his scale, he also reports a dramatic reduc- 
tion in the correlational profile of Ex with the 
MMPI scales when response bias properties 
are removed (Hanley, 1957, p. 395). 

The relationship between endorsement and 
social desirability in the Edwards S-D pool 
has not been reported. Although the items 
were not scaled for social desirability, they 
presumably represent “obvious” extremes of 
this dimension (Edwards, 1957). The possi- 
bility of response bias in this scale, and es- 
pecially in its reduced 39-item form (with 30 
items keyed False), is considered and re- 
jected by Edwards (1957). Nevertheless, the 
apparent similarity of S-D to the Sd-A of the 
present study raises questions as to the “pure- 
of S-D as a measure of 
ability.” 


ness” “social desir- 


Summary 


Two MMPI factor scales derived by Welsh 
(A and R) seem to have special relevance to 
studies of factors, response bias, and social 
desirability in the MMPI. The social desir- 
ability of items from these two item pools 
was rated on a seven-point scale by 181 col- 
lege students. The A items represent the “‘psy- 
chotic” dimension, are keyed “true,” and tend 
to be of uniformly low social desirability 
value. The R items represent the “neurotic’ 
dimension, are keyed “false,” 
heterogeneous and neutral in 
ability value. 

The relationship between endorsement fre- 
quency and social desirability in these two 
item pools and its effect on rational social 
desirability scales constructed from these two 
pools were examined in a different population 
of 379 college students who took a short-form 
MMPI. Correlations of the social desirability 


and are more 


social desir- 


Jerry S. Wiggins and Clark Rumrill 


indices with selected MMPI scales are pre- 
sented and related to previous studies of re- 
sponse bias and social desirability. 
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FACTORS IN PARENTAL BEHAVIOR AND PERSONALITY 
AS RELATED TO PROBLEM BEHAVIOR 
IN CHILDREN 


WESLEY C 


BECKER, DONALD R 


PETERSON, LEO A. HELLMER, 


DONALD J. SHOEMAKER, ann HERBERT C. QUAY! 


University of Illinoi 


During the past decade there has been an 
increasing interest in developing a conceptual 
system for the analysis of parental behavior. 
In seeking this goal, a few have turned to 
factor analysis as a means of organizing ob- 
servations around a relatively small number 
of dimensional concepts. The hope is for the 
development of a of dimen- 
sions which will permit more refined studies 


measurable set 


of the relationships between parent and child 
behavior. The work of Baldwin, Kaihorn, and 
Breese (1945) at the Fels Institute, Roff’s 
(1949) subsequent factor analysis of the Fels 
data, and the recent work by Sears, Maccoby, 
and Levin (1957) have been among the more 
important contributions toward achievement 
of this end 

The goals of the present study are three: 
(a) to examine the factor organization of the 
Fels Parent Behavior scales as independently 
applied to mothers and fathers, (6) to in- 
vestigate possible relationships between ques- 
tionnaire personality factors and parental be- 
havior, and to find out which aspects of 
parental behavior are most closely related to 
behavior disorders in children. The concern 
with the first goal arises from the need for a 
scheme to assess maternal and paternal be- 
havior independently in testing many hy- 
potheses about child development. The secoid 
goal interest in 


arises out of an more par- 


1 The writers wish to express 
Sally 


their indebtedness to 
Beck, Byron Lindholm, and Herbert Blaufarb 
who served assistants, and to the co 


rbana, Iili 


esearch 


operating parents ind teachers from IT 


the members of the Graduate Re 


nois, as well as t 
search Board of th« 
State of Illinois Department of 


their support of the project 


University of Illinois and to the 
Public Welfare for 


simonious means of gathering information for 
the assessment of parental behavior. The step 
taken in this study is considered only a be- 
ginning. The third goal is a kind of jumping 
the gun in our search for important relation- 
ships between parent and child behavior. 


Procedure 
Sub i cts 


The Ss consisted of two groups of families 
one with a child not in need of clinical serv- 
ices (\ 25) and the other with a child in 
need of clinical services (NV 32). The fol- 
used for 
of child between 6 and 12 
years, parents living together, Caucasian ex- 


lowing criteria for selection were 


both groups: age 


traction, and willingness to cooperate. Since 
all but two families who were asked to par- 
ticipate agreed to do so, cooperativeness did 
not act as a Families were 
admitted to the clinic group if they met the 
above criteria and if they were seeking help 
for a problem with their child other than one 
associated with 
deficit 


selective factor. 


brain damage or intellectual 
admitted to the non- 
clinic group on the recommendation of teach- 
ers who were instructed to select children who 


did not need special psychological services 


Families were 


Table 1 presents the means and standard 
deviations for several sample characteristics. 


- Complete data were available for only fathers 
of the clinic group. Therefore the N is two less than 
for mothers. This accounts for minor discrepancies in 
r’s between the same variables in the mother and 
father matrices 

The aim was to maximize child adjustment vari 
ince. Since separate 


study 


criteria of adjustment are pres 


ent in the accuracy of teacher judgment is 


not at issue 
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Table 1 


Sample Characteristics 


Clinic Group Nonclinic Group 


Variable Mean Sigma Mean Sigma 


Age of child 8.7 

Child IQ 107.0 

Socioeconomic status 
(Warner) 

Age of mother 

Mother IQ 

Mother education 

Age of father 

Father IQ 

Father education 


Ye 
“ws U1 
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As one might expect in a university town, the 
average education and intelligence of these 
groups are above national norms. The only 
important sampling imbalance to arise was on 
sex of child. In the clinic group there were 
24 boys and 8 girls. In the nonclinic group 
there were 13 boys and 12 girls. 


Collection of Data 


The present analysis covers ratings made 
from interviews with parents, scores on the 
Guilford-Martin personality inventories, IQ 
and socioeconomic status estimates, and other 
background variables. All data but the inter- 
view ratings were gathered in accordance with 
standard procedures, as will be clear from 
the description of variables given below. The 
ratings were obtained as follows: 

Two rating schedules were completed from 
the interview. The first consisted of 17 of 
the 30 Fels Parent Behavior scales. These 17 
scales were selected on the basis of Roff’s 
(1949) factor analysis in such a way that 
each factor had at least two rating scales 
loading highly (all but one over .50) on the 
factor. A detailed description of these scales 
can be found in Baldwin et al. (1945). 
The only modification of the scales involved 
changing the word home to mother or father 
on a few of the scales. For example, if the 
scale read “adjustment of home” it was 
changed to “adjustment of mother,” or to 
“adjustment of father.” The second rating 
schedule was derived from Himmelweit’s 
(1952) analysis of Ackerson’s (1942) case 


records on 3,000 problem children. This 
schedule required three-point ratings on eight 
behavior characteristics loading on a per- 
sonality problem factor (sensitivity, absent- 
mindedness, seclusiveness, day-dreaming, in- 
efficiency in work, inferiority feelings, change- 
ability of moods, and nervousness) and 11 
behavior characteristics loading on a conduct 
problem factor (truancy from home, truancy 
from school, stealing, fighting, lying, destruc- 
tiveness, swearing, disobedience, rudeness, 
temper tantrums). Only the 
summed scores for personality problem and 
conduct problem were used in the analysis. 
Two additional child behavior variables were 
from the mother 
child school progress (retarded, aver- 
age, advanced) and child social development 
(problem in peer relations, average peer re- 
lations 


selfishness 


also rated interviews, 


namely 


exceptionally good peer relations). 
Since many of the parents had entered a 
clinic to deal with their problem children, in- 
terviews were not completely structured. In- 
stead, a list of topics to be covered was drawn 
up, and the interviewer was left free to cover 
the topics in the most convenient order. This 
procedure is not unlike that used at the Fels 
Institute, except that repeated interviews were 
not possible. Four staff psychologists and one 
advanced clinical psychology student served 
as interviewers. Ratings were made immedi- 
following the interviews. In no case did 
the same interviewer see both parents or have 

outside knowledge of the child’s behavior. 


ately 


Rating Re 


liabilities 


The design of this study did not permit a 
check of two-rater reliability. However, Bald- 
has detailed evidence that the 
Fels scales are sufficiently well defined to pro- 
duce two-rater reliabilities in the .50 to .75 
range. The writers’ 


win present 


use of the Fels scales dif- 
fered in two ways from their use at the Fels 
Institute. First, ratings in this study were 
based on an interview in the clinic rather 
than on a home interview and observation pe- 
riod. Second, mothers and fathers were rated 
separately rather than'as a composite. It is 
not known how these changes affect reliabili- 
ties. 

The training of raters was accomplished by 
having all raters evaluate the same recorded 
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interviews during the preliminary phase of 
the study. Through discussion, the structure 
of the interview and problems in rating were 
clarified. About midway through the study, a 
case was selected at random to check rater 
agreement. Using five- to seven-point scales, 
the percentage of agreement scores between 
raters showed 60% perfect agreement, 24% 
with a deviation of one, and 16% with a 
deviation greater than one. 

Because the final data (correlations) are 
based on scores from five different raters, it 
seems unlikely that individual rater biases 
have systematically affected the findings. A 
possible source of systematic bias, however, 
comes from the inability to prevent inter- 
viewers from knowing whether or not they 
were dealing with a clinic or nonclinic parent. 
The parents had to talk about their children, 
and this made it impossible to hide their 
group affiliation. While all raters made a con- 
scious attempt to avoid such a bias, this must 
remain an unresolved problem. 

Separate estimates of the degree of child 
personality problem and child conduct prob- 
lem were obtained from mother and father. 
Interparentz! agreement was .71 for the for- 
mer and .85 for the latter. 


Method of lysis 


iiel matrices were formed, one for 
one for fathers. 
com 1 and 12 centroid factors ex- 
tracted usiag unit communalities (fixed). Ten 
factors from each analysis were rotated by 
an electronic computer to orthogonal simple 
structure using the quartimax routine (Neu- 
haus & Wrigley, 1954).* The 


Two par 
mothers 
were 


Pearson r’s 


factors are 


# An orthogonal solution was employed in order to 
show the relationships between child behavior and 
parent behavior in the fashion. If the goal 
were only one of defining factors in parent behavior, 
an oblique solution might have been preferred. A 
real possibility in an oblique solution, however, is 
that child behavior would come out as separate fac- 
tors and parent behavior as separate factors, and 
their interrelations would be hidden in higher-order 
factors. Thus a complete higher-order solution would 
have been required to bring out the essential rela- 
tionships which are 
nomically ) 
tion. 

Inspection of factor plots indicates that the ob 
tained solution would not be 


clearest 


(and more eco- 
orthogonal solu- 


just as easily 
demonstrated with an 


appreciably modified 
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presented as rotated by this mathematical 
procedure. 


Description of Variables 


The 46 variables entering each factor analy- 
sis are described below. The direction of a 
high score on each variable is indicated fol- 
lowing the titles. 


1. Age of parent: older 
2. Age of child: older 
3. Sc« of child: male 
. Number of children in family 
. Martial change on part of 
marriage): none 
6. Parent IQ estimate (WAIS 
prehension, Vocabulary): high 
7. Child IQ estimate (WISC 
prehension, Vocabulary): high 
8. Socioeconomic status (Warner scale): low 
9. Education of parent: high 
10. Guilford S: shy 
11. Guilford T: introspective 
12. Guilford D: depressed 
13. Guilford 
14. Guilford R 
15. Guilford G 
16. Guilford A: 
17. Guilford M 
18. Guilford I 
confident) 
19. Guilford N: not nervous 
0. Guilford O: o 
. Guilford Ag 
Guilford Co: cooperative 
Clinic-nonclinic group: nonclinic 
24. Fels, adjustment of parent: maladjusted 
5 inactive 
unsociabl 


more 
parent (previous 
Information, Com- 


Information, Com 


emotionally unstable 

rhathymic 

active 

socially ascendent 

masculine attitudes and interests 


lack of inferiority feelings (self 


ctive 
agreeable 


Fels, activeness of parent 
26. Fels, sociability of parent 
27. Fels, coordination of parent: uncoordinated 
28. Fels, emotionality toward child 


(objec tive) 


nonemotional 


29. Fels, readiness of enforcement: lax 

30. Fels, severity of penalties: mild 

31. Fels, justification of disciplinary policy 
trary 


arbi 


by an oblique solution. Hyperplane counts could be 
improved, but the same factors would probably re- 
main with the same relative order of importance of 
variables loading on them. The fact that the ob 
tained results parallel closely many of Roff’s oblique 
factors adds confidence to the above conclusion 

The reader may legitimately ask why simple cor 
relational analysis would not deal with 
parent-child interrelations. The answer is that the 
factor analysis allows one to see constellations of re- 
iationships which aid in interpretation of the cor 
relations and at the same time allows for the con 
trol of the effects of other factors in the analysis 
(suppressors and inflators) in interpreting relation- 
ships. 


suffice to 
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32. Fels, 
33. Fels, 
(optional) 
34. Fels, 
35. Fels, 
36. Fels, 
osity 


Variable 


OM Se wre 


‘ 
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quantity of suggestion: nonsuggesting 37. Fels, 


coerciveness of suggestion: noncoercive posing) 


38. Fels, 


duration of contact with child: brief 39. Fels, 


democracy of policy: dictatorial effective 
readiness of explanation: thwarts curi- 40. Fels 
cordant) 


Table 2 


Quartimax Factor Load 


general protectiveness: nonprotective (ex- 


solicitous for child’s welfare: nonsolicitous 


effectiveness of disciplinary 


disciplinary friction: nonfriction 


Mother Factor 


policy : 


»o= 
= 


“rn unt Ww 


~3 =F GD = GD 


a 


in 


(con- 
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Table 2—Continued 
Father Factors 


Variable 


41. Child conduct problem, mother informant Results 
problem 4 i . . 
42. Child conduct problem, father informant rhe results of the quartimax solutions for 
problem mothers and for fathers are presented in 
43. Child personality problem, mother informant: Table 2: 


The factors have been ordered so 
problem 


44. Child personality problem, father informant 5 The correlation matrices have been deposited 


problem with the American Documentation Institute. Order 
45. Child school progress: advanced Document No. 5781 from ADI Auxiliary Publica- 
46. Child social development: advanced tions Project, Photoduplication Service, Library of 
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Table 3 


Factor Pair I 


Variable 


41. Child conduct problem, mother informant 
42. Child conduct problem, father informant 
39. Fels, ineffective disciplinary policy 
40. Fels, much disciplinary friction 
28. Fels, emotional toward child 
24. Fels, maladjusted 
23. Clinic family 
2. Younger child 
46. Retarded social development 
27. Fels, uncoordinated parent 
31. Fels, arbitrary disciplinary policy 
1. Younger parent 
25. Fels, active parent 


36. Fels, nonreadiness oi explanation (thwartir 


32. Fels, much suggesting 

35. Fels, dictatorial policy 

38. Fels, solicitous for child’s welfar 
21. Guilford Ag, hostile 

29. Fels, lax enforcement 

15. Guilford G, active 

17. Guilford M, masculine 


* The values from Roff are loadings 
siveness of children in nursery school 


that similar factors from the mother and 
father solutions have the same factor number. 
Very close matches are apparent for Factors 
I to VII. The reader should note that this 
has been achieved by “blind,” analytic rota- 
tion methods. Part of the reason for success- 
ful matches derives from the fact that 11 of 
the 46 variables are based on the same data 
in each analysis (Variables 2, 3, 4, 7, 8, and 
40-46). This overlap particularly affects the 
first two pairs of factors, though similar fac- 
tors would likely have emerged even without 
overlap. 

As a partial check on linearity of relation- 
ships, r’s were computed separately for clinic 
and nonclinic groups. Nonlinearity of regres- 
sion should be indicated by a positive r in 
one group and a negative 7 in the other (pro- 
viding that the groups differ in means on one 
or both variables). By and large, within 
group r’s showed consistent patterns. The 
main area of deviation was in the relationship 
of permissiveness variables (see Table 8) to 


Congress, Washington 25, D. C., remitting $1.25 for 
microfilm or $1.25 for photocopies 


General Family Maladjustment 


Mother 
Loadings 


Father 


Loadings 


Meyer* 


84 
84 
79 
79 
51 


parent-child harmony variables (see Table 
3). For example, in the mother analysis mild- 
ness of penalties correlates —.47 with conduct 
problem in the nonclinic group and +.41 in 
the clinic group. The reader should keep in 
mind that only consistent linear relationships 
are revealed in a factor analysis. Other im- 
portant relationships may be obscured. 


Discussion 


The results will be discussed with respect 
to the three questions posed earlier, but in a 
modified order. To aid the reader in following 
the discussion, the more important factor 
pairs will be presented in separate tables. 
Variables loading over .30 will be described 
so that a positive factor loading coincides with 
the description given. 

Relationships Between Parent and Child Be- 
havior 


Relationships between parent and child be- 
havior appear primarily on Factor Pairs I and 
II. Factor Pair I is presented in Table 3 
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along with Roff’s parent-child harmony factor 
and Meyer’s (1947) correlations of “parent 
behavior” with the degree of aggressive be- 
havior shown by children in nursery school. 
A good match is apparent between mother 
and father Factor I, collectively described as 
general family maladjustment. The results 
indicate that Roff’s parent-child harmony fac- 
tor coincides with Himmelweit’s child conduct 
problem factor and repeat Meyer’s findings 
concerning child 
behavior. 


aggressiveness and parent 

There are a number of interesting hypothe- 
ses which may be deduced concerning family 
relationships fostering conduct problems in 
children. Diagramatically, the following rela- 
tionships are suggested: 


book. 


mothers who were 


In his 
points out that 


recent Sears et al. (1957) 
very per- 


missive with regard to acts, but 
also very punitive and frustrating, would tend 
to have the most children. The 


relationships indicated above suggest another 


aggressive 
aggressive 


possibility. Where one parent is punitive and 


frustrating toward the child (instigating ag 


gression) and the other parent is lax about 
discipline, conditions are right for the devel- 
opment of an aggressive child. A number of 
other possibilities are suggested. Both parents 
give vent to emotions easily, providing a 
model of uncontrolled behavior. Parental con- 
flict seems an inéscapable deduction. Dis- 
placed expressions of this conflict in inter- 
action with the child are possible. Conflicting 
reinforcement of child 
parent is very likely. 
The most interesting finding to emerge from 
this analysis is presented in Table 4 which 
summarizes Factor Pair II 
The loadings of the first five variables indicate 
that the same factor has been extracted from 


behavior by each 


the loadings in 


each matrix, a factor centering on personality 
problem in the child. However, when a com 
parison is made of parental behavior variables 
loading on the factor, four associations above 
.30 are found with father behavior and none 
with mother behavior. The most marked pa- 
rental differences are seen on the maladjust- 
ment rating and the nonreadiness of explana- 
tion (thwarting) rating. At face value, these 
findings suggest the hypothesis that the ad- 
justment of father is more critical in deter 
mining personality problems in children than 
is the adjustment of mother. However, even 
though sex of child does not load on this fac- 
tor, the high weighting of boys over girls in 


the clinic sample may be important in evalu 


Table 4 
ir Il Cy} Pers 


} 
100! progre 
| social develo 


mnreadiness ot 
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ating this finding. The mother results cannot 
be attributed to attenuated ranges for moth- 
ers on these variables. Note the failure of 
protectiveness variables to load on this factor. 

Standard practice in psychiatry and clinical 
psychology for years has been to see mother 
afid child in therapy in problem-child cases. 
Usually this practice has been defended on 
the assumption the mother relationship is 
more important in child development. The 
above findings suggest a need to re-evaluate 
this assumption and to explore in more detail 
the role of the father in child development.*® 

There are two other factor pairs where 
both parent and child behavior loadings are 
found. The loadings on Factor Pair IV (see 
Table 5) suggest that both parents are more 
protective of younger children and of children 
with lower IQs, while the loadings on Factor 
Pair V (see Table 6) indicate that higher 
intelligence in the child and advanced school 
progress are related to democratic attitudes 
on the part of the mother. This latter finding 
is of considerable interest, but of course does 
not indicate a causal relationship. A best guess 
would be that causality is operating in both 
directions. A mother can be more democratic 
with a bright child, and having a democratic 
mother is a positive stimulus for the growth 
of intelligence. In Roff’s analysis, parental 
encouragement of intellectual skills, “accelera- 
tional attempt,” loaded on the democracy fac- 
tor. Elsewhere Baldwin et al. (1945) demon- 
strated significant increases in IQ over a 
three-year period by children of high demo- 
cratic mothers in comparison with children of 
low democratic mothers. 


Comparison of Mother and Father Behavior 
Patterns 


One of the important methodological ques- 
tions this study sought to answer was the 
degree of applicability of Roff’s factor analy- 
sis of the Fels scales when mother and father 
behavior was independently assessed. Five of 


It is of interest to note that the father groups 
also show more significant differences in means on 
the Fels scales than do the mother groups. In addi 
tion, a preliminary analysis of items from a parental 
attitude inventory developed by Peterson indicated 
greater differences between fathers than mothers 
when clinic and nonclinic groups are compared 
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the nine interpretable factor pairs can be re- 
lated in some way to the seven factors derived 
by Roff. Factor Pair I, corresponding to Roff’s 
parent-child harmony factor, has already been 
presented in Table 3. A good match was found 
in this case between mother and father be- 
havior patterns. Also it has been indicated 
that this factor ties in with conduct problems 
in the child and maladjustment in the parents. 

Of the remaining four factor pairs, two 
show a match between mothers and 
fathers, and two do not. Each of these four 
factor pairs will be discussed in turn. 

Factor Pair IV (Table 5) represents a gen- 
eral protectiveness factor called concern for 
the child by Roff. There is a close repetition 
of Roff’s patterns for both mothers and fath- 
ers. The major deviation from Roff’s loadings 
is on the duration of contact variable. A pos- 
sible explanation for this difference lies in the 
fact that Roff’s analysis was on parents of 
preschool children, while the current study 
involves parents of grammar school children. 
Perhaps the amount of contact with child is 
only associated with protectiveness before so- 
ciety places many demands on a child’s time. 
3aldwin (1955) has previously indicated sev- 


good 


eral important relationships between protec- 
tiveness and child behavior. 

Mother and Father Factors V (Table 6) 
show a close match to Roff’s democracy fac- 


Table 5 
Factor Pair [V—Protectivene 


Concern for Child 


Mother 


coordinated 


x female 
hildren in family 


nsociable 


s exter sive contact 
h child 
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Table 6 


Factor Pair V—Democracy of Policy 


Mother 
Loadings 


Father 


Variable Loadings 


Roff 


Fels, democratic policy 7 62 
Fels, extensive contact 
with chil 


nary policy 


rational 


Fels, little suggesting 
Fels 


explanation 


readiness ol 
Satishes curl 


toward chil 
Child IQ high 
Guilford M 

ot interests 
Guilford G 


Child scho 


ot suggesti 


Fels, lax enforce 


tor. While there is considerable agreement be- 
tween mother and father factor loadings, there 
are also important differences. A democratic 
approach to child rearing in the mother shows 
relationships with inactivity and masculinity 
of interests.’ Democracy in the father takes on 
a permissiveness flavor than in the 
For most 


more of 


mother. purposes, it would appear 
feasible to use the same procedures in evalu- 
ating this factor for mothers and fathers. 

A very poor match is found between Mother 
and Father Factor VIII (Table 7). Mother 
Factor VIII contains the typical extraversion 
components (R, S, G, A) 
particularly 


the 


and to be 
characterized by sociability on 
Father Factor VIII is pri- 


seems 


Fels scales 


marily described by general activity. 

It is quite possible that the same factor is 
being assessed in each case, namely, extraver- 
sion, but that in our culture male extraversion 
is revealed behaviorally by activity (task ori- 
entation) and female extraversion by socia- 


7Item analysis indicates that this association is 
mainly due to such items as “Do odors of perspira- 
tion disgust “Are disgusted at the 
sight of soiled fingernails?” High demo- 


frequently answer these ques 


you?” or you 
ragged or 
cratic mothers mors 


tions “no.” 
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bility. In Roff’s analysis of the Fels scales, 
a sociability-adjustment factor and an activity 
factor were found. However, Roff rated ac- 
tiveness of home, sociability of family, adjust- 
ment of home, and coordination of household. 
There was no attempt to separate the be- 
havior of mother and father. Roff’s activeness 
factor appears to tie in with Father Factor 
VIII, and the sociability component of Roff’s 
sociability-adjustment factor with Mother 
Factor VIII. These findings clearly indicate 
a need for separate scaling procedures for 
evaluating maternal and paternal behavior in 
the extraversion area 

The final factor pair defined mainly by the 
Fels scales is presented in Table 8. Here again 
the mother-father match is poor. Mother Fac- 
tor IX is clearly a permissiveness factor simi- 


Table 7 
Mother Factor VIII—Guilford 

Mother 

Variables Loadings 

14. Guilford R 

10. Guilford §$ 

Guilford A 


ascendant 


rhath 


Fels, social 
Guilford G 
Guilford T 
spective 
Guilford D 
depressior 


Guilford I 


Fels, 
Fels coordinated 
Guilford G, 
Fels 
Guilford A 
ascendant 
Guilford R 
Younger child 


active parent 
parent 
active 
strict enforcement 


socially 


rhathymi 


Previous marriage by father 
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Table 8 


Mother Factor [IX—Permissiveness 


Mother 


Variables Loadings Roff 


29. Fels, lax enforcement 
. Fels, mild penalties 
Fels, noncoercive use of suggestion 
Fels, low quantity of suggestion 
. Fels, rational disciplinary policy 
3. Fels, nonemotional toward child 
7. Fels, nonprotective 
Fels, inactive 
. Fels, adjusted 
Father Factor [X—Warm, Extraversion—Hostile 
Withdrawal 


Variables 


Fels, mild penalties 
Fels, sociable 
Guilford R, rhathymi 
0. Guilford S, sociable 
Fels, lax enforcement 


lar to that reported by Roff except that the 
suggestion scales are included in the present 
factor and were not in Roff’s.* It is apparent 
that Father Factor IX also leans toward the 
permissiveness realm but has an essentially 
different composition. If one looks at the op 
posite poles from those given in Table 8, a 
certain familiar ring is detected. The negative 
pattern involves severe penalties (hostility), 
reclusiveness, inhibited disposition, and vigi- 
lance, in the sense of looking for trouble. This 
is suggestive of the schizoid personality. 

The importance of the need to demonstrate 
(not assume) that factor patterns derived 
from the analysis of mother behavior or “pa- 
rent behavior’ are applicable to father be- 
havior is made especially clear in the follow- 
ing example. Assume, as the writers had for 
part of their analysis, that Roff’s analysis 
(see Table 8) had define a 


been used to 


8 The instructions fo 
indicate that the rater should 
amount of control exercised in disciplinary 
tions. This is less easily done in a clinic interview 
than in a home observation This shift in 
rating conditions is a possible basis for the difference 
in factor composition. 


suggestion 


the quantity of 
scale disregard the 


situa 


session 


permissiveness factor for fathers. The best es- 
timate of this factor would be a weighted com- 
bination of readiness of enforcement (Vari- 
able 29) and severity of penalties (Variable 
30). For clinic fathers permissiveness, so de- 
fined, is found to correlate +.37 with disci- 
piinary friction, and for the nonclinic fathers 
the correlation is — 48. If the components of 
permissiveness are examined separately, it is 
found that the source of the association for 
clinic fathers is an r of +.47 between non- 
enforcement and disciplinary friction, while 
the source of the association for the nonclinic 
fathers is an r of —.55 between mild penalties 
and disciplinary friction. An assumption was 
made that a factor found with mothers would 
hold for fathers and it did not. For similar 
reasons the reader is cautioned not to assume 
generalizability of the factor patterns reported 
herein to non-middle-class families. 


nere 


Questionnaire Factors and Parent Behavior 


The final question to be evaluated is the 
relationship of questionnaire personality fac 


tors to parent behavior factors. In general the 


relationships are not high. Mother Factors 


Table 9 


I11—Guilfor 
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I, IV, and V show a few low to moderate 
loadings of Guilford factors, and Mother Fac- 
tor II, comprising primarily Guilford scores, 
shows a few loadings of parent behavior va- 
riables. Father Factors I, VII, and IX also 
show low to moderate loadings of Guilford 
factors. At present one might recommend the 
substitution of Guilford G for ratings of ac- 
tivity of fathers and Guilford R and §S for 
ratings of sociability of mother. Other com- 
ponents of parental behavior will require spe- 
cial efforts at questionnaire development. 

As Table 9 illustrates, 
cohesiveness of the 


there is a high 
Guilford factor scores. 
Eleven of the 13 Guilford factor scores show 
appreciable loadings on what appears to be 
a self-perceptior -of-adjustment factor, or sim- 
ply Guilford adjustment. This factor parallels 
the second-order emotional stability or neu- 
roticism factor discussed by Eysenck (1953) 
On the in unpublished study by 
Becker, this factor should also correlate highly 
with Cattell’s factor called 
anxiety—dynamic integration. Studies by Far- 
ber, Spence, and Bechtoldt (1957) suggest 
that the Taylor Manifest Anxiety Scale would 
also load highly on this factor. Considering 
the current interest in, and claims for anxiety 
it is important to note the lack of 
association of this factor with problems in 
children and the relatively small associations 


basis of 


secon l-order 


scales, 


with parent behavior ratings. From the pres- 
ent findings one must conclude that the do- 


main of personality being tapped by the 


Guilford inventories is quite limited 


The Remaining Factors 


The three factor pairs which have not yet 
entered the discussion deserve brief mention 
No attempt has been made to interpret the 
last factor in each analysis. Factor Pair VII 
is of minor importance, loading mainly on the 
Guilford agreeableness and cooperativeness 
scales. Factor Pair VI repeats closely in both 
mother and father analyses and is defined 
by parental education, IQ, and socioeconomic 
status. It is interesting that the only child 
variable to load on this factor, besides child 
IQ, is school progress. This suggests 
that while a minimum parental intelligence 
is necessary to do a good job as parents, more 


result 


intelligence does not necessarily lead to better 
family adjustment. The generally accepted 
principle of the prepotency of feeling over 
reasoning is consistent with this result. 


Summary 


This study sought to examine the factor 
organization of the Fels Parent Behavior 
scales independently for mothers and fathers, 
to investigate relationships between question- 
naire personality factors and parental behav- 
ior, and to find out which aspects of parental 
behavior are most closely related to behavior 
disorders in children. 

Two groups of families, one with a child 
not in need of clinical services (NV = 25) and 
the other with a child in need of clinical serv- 
ices (NV = 32), were examined. Parents were 
individually given the Guilford-Martin per- 
sonality inventories, an IQ measure, and an 
interview which permitted rating of 17 Fels 
Parent Behavior scales and of the kinds of 
problem behavior displayed by their children 
Two independent factor analyses were carried 
out to orthogonal simple structure, one with 
46 mother-and-child variables and one with 
46 father-and-child variables 
each analysis were interpreted 

Child behavior problems loaded primaril: 


Nine factors in 


on two factors. Conduct problems (aggressive 
uncontrollable) in the child coincided with 
Roff’s parent-child harmony factor in both 
the mother and the father analyses. The pat- 
tern of loadings indicated that in families with 
conduct problem children, both parents are 
maladjusted, give vent to unbridled emotions 
and tend to be arbitrary with the child. In 
addition, the mother tended to be 
(tense), dictatorial, thwarting, and suggest- 
ing, whereas the father tended not to enforce 
reculations. On the other hand, a factor de 
fined mainly by personality problems in the 
child (shy, sensitive, inferior) showed associa- 
tions only with father behavior ratings. The 
father was rated as maladiusted and thwart- 
ing of the child. The many important associa- 
tions between father and child behavior lead 
to the conclusion that 


active 


and 
perhaps therapeutic practice, should give more 
consideration to the role of the father in child 


development. 


future research 
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The mother analysis repeated five of Roff’s 
seven parent behavior factors, namely: parent- 
child harmony, protectiveness (concern) , dem- 
ocracy, permissiveness, and sociability (Roff’s 
sociability-adjustment). Roff’s activeness of 
home appeared only as activeness of the 
father. The father analysis showed parallel 
factors for parent-child harmony, protective- 
ness, and democracy, but a factor designated 
general activity and a factor designated 
warm-extraversion versus hostile-withdrawal 
appeared in place of sociability and permis- 
siveness. These findings suggest differences in 
the organization of maternal and paternal be- 
havior which measurement procedure must 
take into account. 

In general, the Guilford questionnaire fac- 
tors showed little association with parent be- 
havior ratings, the major exceptions being on 
factors called sociability and activeness. A 
number of suggestive correlations are present, 
however, which might help in devising ques- 
tionnaires for assessing parent behavior. 
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PARANOID SCHIZOPHRENIC AND NORMAL 
SUBJECTS’ PERCEPTIONS OF PHOTO- 
GRAPHS OF HUMAN FACES’ 


CARROLL E 


IZARD 


Vanderbilt University 


The purpose of this study was twofold: to 
examine the differences between paranoid 
schizophrenic and normal: subjects (Ss) on 
the basis of their reported perceptions of pic- 
tures of normal adult faces and to examine 
more closely the usefwlness of such pictures 
as a diagnostic device and as a method of 
studying personality. Since Freud’s early 
work (1938) showing the relationship be- 
tween needs (impulses) and projections and 
his original definition of projection as a deter- 
minant of or tantamount to perception, there 
has been a great deal of clinical and research 
evidence pointing to causal interrelations 
among needs, perceptions, and behavior (Bru- 
ner & Postman: 1947, 1948; Cowen and Beier, 
1950; McCleary & Lazarus, 1949; McGinnies, 
1949: McGinnies & Bowles, 1949; Rogers, 
1951; Snygg & Combs, 1949). The effective- 
ness of pictures of people as a method of 
studying these aspects of personality has been 
demonstrated in many studies. Murray’s clas- 
sic in this area, published in 1933, showed 
the effect of situationally induced fear on 
children’s perceptions of pictures. In 1943, 
Murray published the Thematic Apperception 
Test, which rapidly achieved the status of 
model or standard for picture-story tech- 
niques. Since that time, researchers have in- 
troduced variations in the kind of pictures 
used as stimuli, in the sort of responses called 
for, in the testing conditions, and in the kind 
of Ss selected for study (Bell, 1948; McClel- 


1 This paper is based on 
to the Syracuse University in 
1952, in partial fulfillment of the requirements for 
the Ph.D. A summary of the dissertation was read at 
the meeting of the American Psychological Associa- 
tion in 1953 


a dissertation presented 


graduate school of 


land, Clark, Roby, & Atkinson, 1949). The 
variation which probably attracted more at- 
tention than any other was made by Szondi 
and introduced in this country by Deri 
(1949). Szondi featured the face of a mental 
patient in each of his pictures and assumed 
a relationship between S’s response and the 
diagnostic classification of the person pic- 
tured. Szondi’s assumptions have been dis- 
counted in several studies (Davidson, Mur- 
phy, & Newton, 1949; Guertin, 1950; Holt, 
1949; Klopfer & Borstelmann, 1950), but 
his method succeeded in drawing more atten- 
tion to pictures of the human face as stimuli 
facilitating the study of 
(Secord, Dukes, & 
Muthard, 1955). 
The technique used in this study is like 
the “standard” (TAT) in that it utilizes pic- 
tures of people. It differs from the TAT in 
two important ways: 


SOC ial 
1954; 


perception 
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each stimulus card fea- 
tures the face of a single normal adult in an 
ordinary pose, and the response called for 
emphasizes straight reporting of perceptions 
rather than imagination and fantasy. The pic- 
tures used in this study are like Szondi’s in 
featuring the human face but unlike his in 
that the people photographed are normal 
adults. 

The theoretical formulation which furnishes 
the framework for this study maintains that 
there are interrelationships among 
needs, perceptions, and behavior. In keeping 
with current research and clinical practice 
(Belak, 1944; Klopfer, 1949; Sargent, 1945; 
Shneidman, 1951), it was assumed that per- 
ceptions of human photographs can lead to 
valid inferences regarding day-to-day social 
perceptions and behavior. 


causal 
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The general hypothesis was that groups 
which differ widely in terms of observable 
behavior and inferred needs, namely, para- 
noid schizophrenics and normals, would differ 
in their perceptions of pictures of people. In 
particular, four hypotheses were defined for 
testing: 


1. Judges’ ratings of the favorableness of 
feelings or attitudes expressed in the verbal- 
ized responses will be significantly lower for 
the paranoid schizophrenics than for the 
normals. 

2. Paranoid schizophrenic Ss will give a 
greater number of responses that will be 
scored by the judges as internally inconsistent 
or ambivalent. 

3. Judges trained in psychology, psychia- 
try, or social work will be able to utilize the 
protocols of verbalized responses for distin- 
guishing between paranoid schizophrenics and 
normals. 

4. The paranoid schizophrenic and normal 
groups can be differentiated on the basis of 
their preferences for the experimental pictures. 


Procedure 


Subjects. The Ss were 30 paranoid schizo- 
phrenics and 30 normals. The schizophrenics 
were selected from patients recently admitted 
to the Veterans Administration hospital where 
the research was conducted. Patients selected 
as Ss had been independently diagnosed as 
paranoid schizophrenics by the psychological 
staff and by the psychiatric staff. Their mean 
age and educational level were 31.0 and 11.4, 
respectively. The normal Ss were volunteers 
from the hospital employees and from a uni- 
versity extension course in psychology. Their 
mean age and educational level were 32.9 and 
12.7, respectively. The schizophrenic and nor- 
mal Ss were approximately matched in terms 
of occupational level. 

Construction of the experimental instru- 
ment. To obtain a pool of photographs, a 
story on the proposed project was published 
by the newspapers of the central New York 
city where the study was initiated. The story 
indicated that a photographer would be sta- 
tioned at a convenient downtown location and 
requested people to contribute a photograph 
for research purposes. In this way, approxi- 
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mately 1,000 photographs were obtained. The 
pictures covered a wide range of ages and 
socioeconomic classes. The sample included 
professional and business people, employees 
of nearby stores, shoppers, and students. For 
the purposes of this study the sample was con- 
sidered sufficiently representative of the nor- 
mal adult population. For each sex, four 
groups of eight pictures were selected at ran- 
dom. Ten normal people ranked each of these 
groups of eight in order of preference. The 
eight pictures of each sex whose mean prefer- 
ence value most nearly approximated 4.5 and 
whose frequencies were most evenly distributed 
among the eight possible ranks were chosen 
as the experimental series. This procedure was 
followed in order to rule out systematic bias 
in the selection of the experimental pictures 
and to avoid choosing pictures that would 
tend to evoke stereotyped responses. 

Administration of the pictures. Verbal and 
nonverbal responses were obtained from the 
Ss. For the verbal responses ‘the pictures were 
presented one at a time to an individual S 
who was instructed as follows: 


Whenever we see a person for the first time we 
can usually tell if we like him. Just from our first 
glance at the person we get some idea about what 
kind of person he is. Here are the pictures of some 
people—I'd like to see how they impress you when 
you see them for the first time. I don’t know who 
these people are or anything about them. I’d just 
like to get your impression of them, see what you 
think of them. When I hand you a photograph, I 
want to know everything that the picture tells you 


about that person 


For the nonverbal responses or preference 
ratings, the 16 pictures were first divided into 
groups of eight. The Ss were required to rank 
each group in order of preference. (This was 
termed Preference Test 1.) Then, from all 
16 pictures, the Ss were requested to choose 
the picture liked best of all, the one liked 
next best, the one liked least of all, and the 
one liked next least. (This was called Prefer- 
ence Test 2.) 

Judges’ 


sponse & 


evaluation of the verbalized re- 
A pilot study with different pictures 
and Ss served as a basis for formulating a 
seven-point scale for evaluating the favorable- 
ness of feelings or attitudes expressed in the 
verbal responses. Feelings or attitudes were 
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rated +1 to +3 if favorable toward the pic- 
ture, —1 to —3 if unfavorable, 0 if neutral. 

In addition to the scale ratings, responses 
were scored A if they were indicative of am- 
bivalence. In this connection, an ambivalent 
response was defined as one that was inter- 
nally inconsistent; that is, a response in which 
the S expressed contradicting favorable and 
unfavorable feelings toward the picture. Fi- 
nally, on the basis of the over-all protocols 
of verbal responses, the Ss were classified as 
schizophrenic or normal 

Nine members of the professional staff of 
the hospital served as judges in evaluating 
the verbal responses. For the judging proce- 
dure the 60 protocols were divided into three 
sets, each consisting of equal subsets of schizo- 
phrenics and normals. A set was evaluated 
independently by a psychologist, a psychia- 
trist, and a social worker. The arrangement of 
protocols and judges was random, except for 
the restriction that each diagnostic group and 
each judges be 
equally in the three replications. 


profession of represented 


Analysis and Results 


Test of Hypothesis 1. The judges’ ratings 
of the verbal responses of the two groups were 
compared by analysis of variance. The unit 
variate was the total of a single judge’s rat- 
ings of the verbal responses given to a picture 
by a subset of 10 Ss. A probit diagram indi- 
cated that these data approximated normal 
form. The analysis of the judges’ ratings is 
presented in Table 1. 

The highly significant variance due to the 
difference between the mean ratings of the 
schizophrenic and normal groups. supported 
Hypothesis 1. That is, the schizophrenics ex- 
pressed significantly more unfavorable feel- 
ings toward the pictures than did the normals. 

Several other terms in the analysis deserve 
comment. The first term in the analysis 
showed that in terms of the judges’ ratings 
there was significant variability in the re- 
sponses to the different pictures. The variance 
due to interaction of diagnosis and pictures 
showed that some pictures differentiated more 
effectively than others. The interaction term 
“Schizophrenics vs. Normals X Professions of 
Judges X Pictures” indicated that the experi- 
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mental instrument was a reliable test, insofar 
as reliability was indicated by the fact that 
the three professions differentiated consist- 
ently between the responses of the schizo- 
phrenic and normal Ss. 

The variance due to the difference between 
schizophrenic and normal was highly signifi- 
cant when compared with any of the possible 
error variances, except that due to the inter- 
action of “Schizophrenics vs. Normals X 
Sets.” The significance of the latter term and 
its magnitude relative to the main effect indi- 
cated that in other samples of similarly se- 
lected sets of Ss there may be some sets in 
which schizophrenics and normals would not 
be differentiated. A separate analysis of vari- 
ance for each of the three sets showed that 
the two groups were not differentiated in Set 
1 of the present experiment. However, the 
variance due to judges was significant in Set 
1, but not significant in Set 2 or Set 3. An 
examination of the total judging performance 
of the individual judges gave evidence that 
a large part of the difficulty was due to the 
poor performance of one particular judge. In 
view of this, the possible limitation which the 
above interaction variance placed on the gen- 
eralizations to be made with respect to the 
populations was not considered serious. 

A separate analysis of variance was com- 
puted for each of the six subsets of Ss. The 
unit variate was the sum of the ratings given 
by one judge to an S’s 16 verbalized responses. 
This analysis showed that for each subset the 
variance due to differences 


among Ss was 


highly significant, indicating that the experi- 
mental instrument differentiated not only be- 


tween the two diagnostic groups but also 
among individuals within these groups. This 
analysis also showed that the error (judges 
x Ss) variances for the 6 subsets were homo- 
geneous by the Fy, test. 

Test of Hypothesis 2. The frequencies of A 
(ambivalent) responses for the schizophrenic 
and normal Ss were compared by analysis of 
variance. The unit variate was the total fre- 
quency of As assigned by a single judge to 
all verbal responses given by a subset of 10 
Ss. The variates were changed to percentages 
and then converted to angles by means of the 
inverse sine transformation (Bliss & Calhoun, 
1954). The observed error variance was con- 
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Table 1 


Analysis of Variance of the Judges’ Ratings of the Verbalized Perceptual Responses 


Source of Variance 


Pictures 
Sets 
Pictures X Sets 


Professions (of Judges) 
Professions X Sets 
Professions X Pictures 
Professions X Sets X Pictures 
SRPs vs. Ns* 

SRPs vs. Ns X Pict' 

SRPs vs. NsX Sets 

SRPs vs. NsX Pro 

SRPs vs. NsXSetsX Pro 

SRPs vs. Ns&X Pict X Sets 
SRPs vs. NsX Pro&X Pict 

SRPs vs. NsX Pro&X Pict X Sets 


we WwW 
SSSenwnu- 


Nm 
x 


Total 
Correction for mean 
Initial unit variate 


~ 
. 2 
oe 


* Here and in the succeeding tables SRPs and Ns are used a 


tively 


b In the last section of the analysis the following abbreviat 


sistent with that expected from the binomial 
distribution. The results of the analysis are 
presented in Table 2. 

The variance due to the difference between 
group means was significant at the .05 level. 
It was concluded that as compared with the 
normals, the schizophrenics gave significantly 


Table 2 


Analysis of Variance of the Ratings of Interna 
Inconsistency or Ambivalence (A 


Source df Variance 


Sets 120.988 


Professions 497.881 10.566 


Sets X Professions 


SRPs vs. Ns 

SRPs vs. NsX Sets 

SRPs vs. NsX Professions 
SRPs vs. NsX Sets X 


Professions 


Total 
Correction 
Initial variate 


Variance ? P< 


324.937 
67.542 
38.990 


98.792 


93.583 


200.500 .00000000 1 
001 
302.167 001 
31.792 01 
28.365 01 


001 


14 559 


= 


35 43 
7 671 


5.364 


more responses that were rated ambivalent or 
internally inconsistent. 

Test of Hypothesis 3. As noted earlier, each 
protocol was assessed independently by a psy- 
chologist, a psychiatrist, and a social worker. 
found that at least two members of 
a team correctly diagnosed 24 of the 30 
schizophrenics and 22 of the 30 normals. Al- 
together, 68% of the diagnostic judgments 
From a x” analysis of the data, 


It was 


were correct 


it was found that judges utilizing the protocols 


of verbalized responses as diagnostic informa- 
tion were able to differentiate a significantly 
greater number of schizophrenics and normals 
than would be expected by chance. The poor 
performance of a single judge, as noted above, 
contributed heavily to the disappointingly low 
percentage of accurate diagnostic judgments. 

Test of Hypothesis 4. The ordinal data 
derived from the two preference tests were 
transformed to rankits for analysis of vari- 
Both of the preference tests failed to 
the two However, the 
variances due to pictures were again highly 


ance 


differentiate groups. 


significant. 
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Table 3 


Analysis of Variance of the Consistency of Per 
iormance of SRPs and Ns on the Two 


Picture Preference Tests 


Source of SRP N 


Variance Variance Variance 
Pictures 15 
Pictures X 

test 15 


23.0014 26.3839 


3.6935 0.9361 3.946 


Total 30 =27.2949 = 27.3200 


* The F rat 
level for both gr 

** The P va 
difference betweer 


of the differer 


In administering the two preference tests 
to the schizophrenics it was noted that the 
relative preference for a given picture in Test 
2 was often considerably different from that 
on Test 1. In one extreme case, a picture 
ranked first on Test 1 was chosen as least 
liked on Test 2. A graphic analysis of the raw 
data and a comparison of the pictures’ rela- 
tive ranks on the two tests suggested that the 
consistency of performance from test to test 
was different for the two groups. In order to 
test this statistically, the data from Preference 
Test 1 and that from Preference Test 2 were 
reduced to comparable unit variates, and the 
two groups were compared with respect to 
consistency of test-to-test performance by 


analysis of variance. The analysis is presented 
in Table 3. 
Comparison of the 


Picture X Test” vari- 
that of the two groups the 
were significantly con- 
sistent in- their performance on the two prefer- 
ence tests 


} 
showed 
schizophrenics 


ances 


less 


Discussion 


In this study the stimulus situation, includ- 
ing the oral instructions to the Ss, was de- 
signed to evoke responses of somewhat differ- 
ent nature than those typically called for in 
projective testing. The Ss were not asked for 
an imaginative production or encouraged to 
let themselves go in a world of fantasy. 
Rather, they were asked for their immedi- 
ate perceptions of the photographs—their 
thoughts and feelings about the people pic- 
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tured. It is reasonable to assume that re- 
sponses so obtained are less characterized by 
the mechanisms of projection and identifica- 
tion than typical projective data. The orally 
reported perceptions of human photographs 
in the present study interpreted as a conven- 
tional projective response would simply lead 
to the conclusion that paranoid schizophrenics 
are generally tense, suspicious, hostile, and 
threatening people. Considering them more 
as perceptual reports, these responses might 
be interpreted as indicating that paranoid 
schizophrenics are strongly inclined to per- 
ceive other people as tense, suspicious, hostile, 
and threatening. 

The ambivalent verbal responses and the 
picture preference data pointed to a lack of 
consistency in the performance of the para- 
noid schizophrenic Ss. However, the 
“ambivalent” and “inconsistent” 


terms 
do not ade- 
quately describe these Ss’ test performance or 
the day-to-day behavior which might be in- 
ferred from it. The data suggest that the 
paranoid schizophrenic is more effectively de- 
scribed as characterized by sudden shifts of 
feelings and attitudes. These rapidly changing 
attitudes, feelings, or needs would naturally 
be concomitant with highly variable behavior. 
This concept of highly variable behavior lends 
itself more readily to operational definition 
and experimentai investigation than does the 
more elusive concept of ambivalence 


Summary 


Essentially, this study was designed to ex- 
amine the efficacy of photographs of normal 
human faces device and as a 
method of studying personality. The data 
obtained made possible an evaluation of the 
orally reported perceptual responses and pref- 
erence ratings of 


as a diagnostic 


30 paranoid schizophrenic 
Ss in terms of comparable behavior of 30 
normal Ss 

Oral reports of the Ss’ perceptions of the 
16 photographs were obtained. Following this, 
two different techniques were utilized to ob- 
tain the Ss’ preference ratings of the experi- 
mental Judges evaluated the Ss’ 
with 


pictures 
verbal favorable- 


ness of feelings expressed and internal incon- 


responses respect to 


sistency. 
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The experimental instrument differentiated 
the paranoid schizophrenic and normal Ss in 
terms of: (a) favorableness of feelings ex- 
pressed, (5) ambivalence of verbalized re- 
sponse, (c) consistency of preference ratings, 
and (d) diagnostic significance of the over-all 
protocol. The concept of highly variable needs 
and feelings leading to highly variable per- 
ceptions and behavior was suggested as a 
more useful concept than that of ambivalence, 
particularly in the study of paranoid schizo- 
phrenic personalities. 


Received February 28, 1958. 
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In many outpatient mental hygiene clinics, 
such as the one involved in this study, almost 
all patients who apply for treatment are 
eventually accepted. Unfortunately, there is 
included in this group an alarmingly large 
proportion of individuals who go through a 
complex intake procedure only to abruptly 
terminate individual psychotherapy after two 
or three contacts. If there were a way of iden- 
tifying these abrupt terminators before the 
institution of individual therapy procedures, 
some alternative treatment approaches could 
be attempted. This paper describes a method 
of identification of abrupt terminators and a 
cross-validation of this method. The method 
is based on the Rorschach test. 

While attempts have been made to predict 
continuance of individual psychotherapy in 
outpatient settings from Rorschach perform- 
ance (Auld & Eron, 1953; Gibby, Stotsky, 
Hiler, & Miller, 1954; Kotkov & Meadow, 
1953; Rogers, Knauss, & Hammond, 1951), 
several problems need consideration in addi- 
tion to further efforts at validation in various 
clinic settings. Meehl and Rosen (1955) have 
indicated that the usefulness of such pre- 
dictive instruments must take into account 
the ways in which such instruments can be 
meaningfully employed in the clinic setting. 
In addition, they have indicated that the base 
rates of termination must be considered, both 
in the selection of criterion groups and in the 


1 Now at the Nebraska Psychiatric Institute. 
2 At the VA West Side Hospital, Chicago, Illinoi 
at the time of inception of this study 


University 


estimation of the possible actual efficiency of 
the predictive instrument. Before an inde- 
pendent attempt was made to develop a pre- 
dictive instrument from the Rorschach test, 
an evaluation of the usefulness of available 
predictive instruments was made, taking into 
account the base rates for continuance in indi- 
vidual psychotherapy in the Chicago clinic. 
Kotkov and Meadow (1953) developed a dis- 
criminant function using R, FC-CF, and D%, 
which was able to predict 81% of the actual 
terminators before the ninth interview and 
58% of the remainers after the ninth inter- 
view. As the actual percentage of patients 
terminating before the ninth interview in the 
Chicago clinic is 69%, the percentage of total 
correct decisions with this base rate taken into 
account would be 74%. This is only slightly 
higher than the 69% accuracy which would 
be achieved by calling every incoming patient 
a terminator. Gibby, Stotsky, Hiler, and 
Miller (1954) developed a discriminant func- 
tion equation using R, K, and m, which when 
cross-validated was able to predict 87% of 
those that terminated before the twentieth 
interview and 37% of the remainers after the 
twentieth interview. Applying this formula to 
the Chicago clinic, where 85% of the patients 
terminate before the twentieth interview, 
yields a percentage of accuracy of total cor- 
rect decisions of 79%. This is lower than the 
accuracy achieved by predicting that every 
patient would be a terminator, i.e., 85%. 

In a survey of attrition in the Chicago 
clinic over a seven-year period, it was found 
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that approximately 50% of the patients ac- 
cepted for psychotherapy drop out before the 
fourth interview. As pointed out by Meehl 
and Rosen, this proportional distribution of a 
dichotomous attribute is most easily improved 
upon by predictive instruments. 

The purpose of this investigation was to 
arrive at and cross-validate an instrument 
which could identify a group of patients who 
would terminate individual psychotherapy 
abruptly, in this case, before the fourth inter- 
view. 

Procedure 


The Rorschach records initially used in 
this study consisted of the test performances 
of 75 male veterans, service-connected for a 
psychiatric disorder, who had requested and 
been accepted for individual psychotherapy. 
Patients with a service-connected diagnosis of 
psychosis or brain injury were not included. 
The cases were selected from the clinic rec- 
ords and covered a seven-year period. The 
records had all been scored by examiners of 
at least one full year of clinical diagnostic 
experience using Beck’s system of scoring. 
The following Rorschach categories were 
tabulated: R, W, D, Dd, C, CF, FC, M, H, 
Hd, A, Ad, Y, YF, FY, F%, F+%, A%, 
P, and number of content categories. 

Utilizing the results of these tabulations, 
a discriminant equation (Hoel, 1947) was 
developed, and an appropriate cutting score 
was selected to isolate a group of abrupt ter- 
minators, those who drop out before the 
fourth interview. As pointed out above, the 
base rates are such that they would not im- 
pair the efficiency of the predictive equation. 

The discriminant equation was cross-val- 
idated on a second independent sample of 
100 patients, 50 remainers and 50 terminators 
selected according to the same criteria as 
governed the choice of the initial sample. 


Results 


Table 1 presents the means and medians of 
each of the Rorschach categories investigated 
in the original 75 records. As can be seen, 
R tends to increase as time in treatment in- 
creases. An analysis of the differences in R 
among the three groups was highly significant 
(F =5.76, 2,72 df, p < .01). Since all other 
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Table 1 


Rorschach Performance as a Function of Number of 


Therapy Interviews Attended 
Total Number of Therapy 
Interviews Attended 


Rorschach 


Category 


20 & over 
V=25 





V ean 
18.72 
5.56 6.80 
11.44 14.00 
1.64 1.92 


V ean 
41.92 
9 40 
22.20 
10.20 


79 


V ean 
22.79 


2.68 
1.60 
8.40 
3.20 
3.08 
3.00 
10.76 
3.96 
12.12 
6.40 
74* 


56" 


41° 


Rorschach categories are in varying degrees 
dependent on R, it was decided to match sub- 
groups for R so that each subgroup consisted 
of 15 test records. The cases matched covered 
a range of 8-40 responses; the mean number 
of responses for each of the groups was 23.5. 
Statistical comparison of the matched groups 
on the other Rorschach categories was done 
using the analysis of variance by ranks tech- 
nique (Moses, 1952). The only significant 
in M, (y’ne = 6.43, 2 df, p 
.05). The other category which approached 
significance, when the number of responses 
was controlled, was H (y*z = 3.70, 2 df, p 
ad Ds 
Using the discriminant function technique 
(Hoel, 1947), it was decided to develop a 
predictive equation using R, M, and H as 
variables. Solving for these variables, using 
the unmatched data from the 0-3 and the 
20 and over groups, the following equation 
was found: Continuance (Co) = R+ 19.8M 
18.6H. Continuance scores were computed, 
and a cutting point of 11.5 was established, 


difference was 
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Table 2 


The Continuance Score as a Function of 
Therapy Attendance in Sample I 


Number of Therapy 
Interviews Attended 

Continuance 
Score 0-3 


4 or More 


<11 20 
>11 


mbers of individuals 


those scoring less than 
as terminators. 

The equation was first evaluated by com- 
paring a group of 50 terminators with a group 
of 50 remainers. This was done by adding 25 
terminators to the original sample of 75 cases. 
This sample of 100 cases is called Sample I. 
Table 2 presents a contingency table relating 
continuance score and number of therapy in- 
terviews for Sample I 


11.5 were identified 


The difference between these frequencies is 
significant (y? = 10.45, 1 df, p < .01). 

The cross-validation was performed with 
an independent sample (Sample II) of 50 
terminators and 50 remainers who met the 
same criteria as governed the selection of 
Sample I. The results are presented in Table 
3. The difference between the frequencies is 
significant ( 6.73, 1 df, p< .01). 

It should be noted that for Sample II, tak- 
ing base rates into account, the over-all per- 
centage of correct decisions would be 63%, 
an improvement of 13% over the 50% accu- 
racy of calling every patient a terminator. 
However, the purpose to which the equation 
will be put must also be considered in evalu- 


Table 3 


nuance Score as a Function of 


rapy Attendance in Sample IT 


Number of Therapy 
Interviews Attended 
Continuance 


score 4 or More 


<11 9 
41 


f individuals 
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ating its potential usefulness. We were only 
interested in making a positive diagnosis of 
termination. As Meehl and Rosen (1955) 
have indicated, the probability of a positive 
diagnosis being correct, if base rates are 
equal, is the ratio of the valid positive rate 
to the sum of the valid positive rate and the 
false positive rate. Applying this to Sample 
II, the probability of correctly predicting a 
terminator is .71. 

Auld and Eron (1953) have presented evi- 
dence which suggests that intelligence meas- 
ures are as effective as Rorschach variables 
in the prediction of continuation in psycho- 
therapy. An evaluation of this possibility was 
made in the context of this study. Wechsler 
Bellevue Form I Verbal IQ scoizes were avail- 
able on 24 patients in the study who ter- 
minated before the fourth interview 
38 patients who terminated after the fourth 
interview. The mean IQ of the 0-3 group was 
101.4, and the mean IQ of the 4 and over 
group 107.44. While the difference be- 
tween these two groups did not reach signifi- 
cance (¢ = 1.58, 60 df, p = .06), an inspec- 
tion of the distribution of IQ scores in each 
group suggested that IQ could be a useful 
predictor if one wished to isolate a group of 
patients who would remain in treatment be- 
yond the third interview. In our sample, 32% 
of the stayers had IQs above 115, whereas 
only 8% of the terminators scored above 115. 
In the prediction of the abrupt terminator, 
however, use of IQ alone is not effective, 
e.g., identifying patients scoring below 95 as 
terminators 29% of the 
but also includes 21 


and on 


was 


actual ter- 
of the actual 


selects 
minators, 
remainers 


Discussion 


The predictive instrument developed in this 
study isolates as abrupt terminators a group 
of patients with fewer than 11 Rorschach 
responses and no M and/or a group which 
tends to give H responses that are not scorable 
as M. A clinical interpretation of the Ror- 
schach records of this group of patients would 
indicate that they are persons who are char 
acterized by a limited verbal productivity and 
a disinclination to spontaneously express ideas 
which deal with human activity. In view of 
the degree to which abilities to verbalize and 
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maintain a discussion of interpersonal events 
are major expectations of therapists, psycho- 
therapy may place an initial demand on these 
patients which they are ill-equipped to meet. 
While the underlying basis for these deficien- 
cies may vary, e.g., impoverished background, 
low intellectual capacity, marked anxiety, ex- 
treme inhibition, etc., the immediate problem 
they present could be approached directly. 

As an initial step in investigating ways of 
remedying the degree to which the patient 
might feel at a disadvantage in the initial 
stages of treatment, the following technique 
is suggested. These patients could be assigned 
to pretherapy groups oriented around dis- 
cussion in the topics of personality processes, 
symptom development, and the purpose of 
psychotherapy. The aim of the groups weuld 
be to supply the verbally inhibited patient 
with added resources for entering into an 
interaction with the therapist. The effective- 
ness of this procedure or others designed to 
facilitate entrance into individual psycho- 
therapy could be evaluated against the pre- 
dictions of the formula. 


Summary 


An instrument utilizing R, M, and H from 


the Rorschach test was developed for the pur- 
pose of predicting the abrupt terminator in 
individual psychotherapy. The probability of 
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making a correct positive diagnosis of ter- 
mination was .71, and the instrument isolated 
44% of the actual terminators. The abrupt 
terminator isolated in this study is character- 
ized by limited verbal productivity and «void- 
ance of expression of ideas dealing with 
human activity. Investigation of alternative 
approaches to these patients is in order, rather 
than continuing to assign them to a situation 
that they routinely avoid. 
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Human figure drawings as diagnostic tools 
are finding increased use and acceptance 
among psychologists. The purpose of this 
study was to determine whether or not human 
figure drawings by physically disabled per- 
sons could be differentiated from human fig- 
ure drawings by physically normal persons, 
and to investigate some of the factors involved 
in the differentiation. 

The need for the investigation arose from 
the observation that previous studies yielded 
contradictory results. Abel (1953), Tolor 
(1955) and Tolor & Tolor (1955) indicated 
that figure drawings were valid projections of 
the drawer, while studies by Simms (1951), 
and Silverstein and Robinson (1956) did not 
support this finding. Studies by Albee and 
Hamlin (1949, 1950) showed that judges 
could reliably judge figure drawings, although 
Tolor and Tolor (1955) and Silverstein and 
Robinson (1956) concluded they could not. 
In most of these studies, however, certain con- 
ditions existed in the design that would sug- 
gest that important variables which could 
have influenced the results may not have been 
adequately controlled. 

Among these sources of variability was the 
wide variation in methods used by the investi- 
gators. Blanchard (1952) and Tolor (1955) 
tried global judgments, Lehner and Gunder- 
son (1952) used signs and check lists, Abel 
(1953) employed categorization, while Simms 
(1951) and Silverstein and Robinson (1956) 
applied more than one of the above. 

Another source of variation was suspected 
to lie in the frequent use of children as sub- 
jects, since in studies by Goodenough and 
Harris (1950) and Machover (1953b) pos- 
sible developmental factors, both in motor 


1 Now at State College, St. Cloud, Minnesota 


and personal adjustment areas, were fre- 
quently found in children’s drawings. In addi- 
tion, Tolor (1955), Simms (1951), and Sil- 
verstein and Robinson (1956) used children 
in their studies and their results were mostly 
negative. 

Still a third source, and one that this study 
attempts to evaluate, might have been in the 
usual practice of selecting judges for the 
studies solely on the basis of some academi 
and/or experience factor. Simms concluded 
that psychologically experienced judges who 
knew the techniques of figure drawing analy- 
sis did no better in judging the drawings than 
naive and unsophisticated judges. Albee and 
Hamlin (1950) concluded that nonpsycholo- 
gists not experienced in projective techniques 
could make judgments of the figure drawings 
as reliably as the clinical psychologists used 
in the study. This would seem to indicate that 
some other factors were operating that might 
be only indirectly related to experience and 
knowledge of drawing analysis 

Two studies in the counseling area serve as 
examples of such a phenomenon 
(1952) discovered a tendency for 
style” to divide into “affective 
tive” areas and for counselors to respond 
differently to each area. McGowan (1955) 
found further that could be 
grouped according to whether their interview 
responses primarily concerned the “content’ 
or “affect” of the client-counselor relationship 
and that “style” is not so much 
attributable to professional training as to the 


Dipboye 
“counselor 
ind “cogni- 


counselors 


counselor 


counselor’s own personality and his readiness 
to use or t use any 
technique. 

As related to the use of 


spec ific counseling 


figure drawings, 
these studies would suggest that judges might 
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evaluate figure drawings more in the light of 
their professional orientation or leanings than 
in the light of their formal training and ex- 
perience. This would imply a potential vari- 
ability in successfulness, particularly, if one 
orientation is more conducive to effective or 
reliable drawing analysis. In the present 
framework, one orientation might be consid- 
ered “affective” and involve an impressionis- 
tic or “feeling’’ approach to the drawings, and 
the other orientation might be regarded as 
actuarial or “cognitive,” with a tendency to 
evaluate the drawings more in terms of spe- 
cific signs or factors. 

When these possible sources of variation in 
the studies on figure drawings are considered, 
the contradictions found seem more under- 
standable, and the present study represents 
an attempt to control for them in the follow- 
ing manner. First, the design of the study 
makes use of the short inspection type of 
drawing analysis commonly used in their 
everyday application. Second, the subjects for 
the study were adults, and third, a prelimi- 
nary attempt was made to select and separate 
judges not only on the basis of experience and 
familiarity with figure drawings but, also, on 
their professional or psychological orientation. 

The hypotheses of the study are: (a) hu- 
man figure drawings by physically disabled 
persons can be differentiated from human fig- 
ure drawings by physically normal persons; 
(6) experienced psychologists with a similar 
knowledge of drawing analysis differ in their 
ability to judge the drawings; (c) judges with 
psychological training and knowledge of draw- 
ing analysis can differentiate the drawings 
with greater success than untrained judges 
naive in drawing analysis. 


Subjects and Procedure 


Drawings. Human figure drawings were ob- 
tained from each of the 60 subjects (Ss) in 
the study in the manner suggested by Mach- 
over (1949), with a drawing of each sex com- 
prising a pair. All identifying data, except the 
sex and age of the drawer, were removed, and 
the pairs were randomly intermixed and num- 
bered from 1 to 60. The top drawing in each 


pair was the one made first. 


These randomly ordered pairs were pre- 
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sented to each judge, and he was told to 
separate them into two groups—the drawings 
by the disabled in one and those by the 
physically normal in the other. The order of 
his placements was tabulated, and each was 
scored as “right” if placed in the group to 
which its drawer actually belonged or “wrong” 
if not so placed. His rights or wrongs could 
vary from 0 to 60, since his choices were not 
forced into equal piles. Individual and group 
results were analyzed by chi square, using a 
one-tailed test of significance and .05 as the 
accepted level of confidence. 

Subjects. The disabled group consisted of 
20 males and 10 females ranging in age from 
20 to 70. The criteria for their selection in- 
cluded their being at least 20 years of age, 
and the each of some visible 
physical disability such as an amputation. An 
$ was not included if he also had some diag- 
nosed pyschological disability, such as neuro- 
sis or mental deficiency. There may have been 
some incidence of these, but none were dis- 
covered in the psychological examinations of 
this group. The group was essentially of aver- 
age intelligence and from the lower-middle or 
lower, socioeconomic backgrounds. The Ss 
were part of The Greater Kansas City Survey 
and Demonstration Project for the Handi- 
capped, conducted by Community Studies In- 
corporated, of Kansas City, Missouri. 

The 


males and 


presence in 


normal group 
20 females, aged 21 
selection criteria consisted of being at 


was made up of 10 
to 59. The 

least 
20 years of age, functioning adequately in an 
occupation and with no visible physical dis- 
Factors of intelligence and psycho- 
logical adjustment were held constant by ran- 
domly selecting people functioning adequately 
in an unselected group of occupations. An 
assumption was made that the incidence of 
atypical factors would be no greater than in 


abilities 


any population of normal people. Occupations 
include: secretary, recreation 
receptionist, clerical worker, and 
building maintenance (not custodian). The 
Ss were not objectively evaluated, but in- 
comes, occupations, and social characteristics 
would tend to place them at a lower-middle 
or upper-lower socioeconomic level. 

Judges. The judges for the study were se- 
lected with the aid of a short information 


represented 


worker 
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form, developed to identify each judge’s feel- 
ings toward professional questions concerning 
the use of tests, attitudes toward figure draw- 
ings, counseling style, etc. Responses to each 
item were evaluated by three doctoral candi- 
dates in counseling psychology as either tend- 
ing toward a cognitive or toward an affective 
point of view, using criteria suggested by the 
investigator. An inspection of the results of 
the evaluations of all items for each judge 
roughly indicated the general direction of his 
professional orientation. 


The three groups of judges were made up 


as follows: (a) Group I was composed of 
three persons with doctor’s degrees in coun- 
seling or clinical psychology and professional 
experience beyond the degree. Each had a 
basic knowledge of figure drawing analysis 
and some experience in its use. Their general 
professional orientation was determined to be 
“affective.” (6) Group II consisted of three 
with in counseling 
or clinical psychology and professional ex- 
perience beyond the degree. Each had a basic 
knowledge of 


persons doctor’s degrees 


figure drawing analysis and 
some experience in its use. Their general pro- 
fessional determined to be 
(c) Group III was made up of 
three persons with doctor’s degrees in educa- 


orientation was 


“cognitive.” 


tion and professional experience beyond the 
degree. No one in this group had any knowl- 
edge of figure drawing analysis. Their 
fessional orientation was unknown. 


pro- 


Results 


In order to evaluate possible effects on the 
judge’s decisions of the uneven distribution 
of males and females in the two groups of Ss, 
the judgments were first analyzed by sex 
That is, tables were developed 
to see if any relationship existed between the 
sex of the Ss and how the judges sorted their 
drawings. In other words, is there a relation- 


contingency 


ship between the proportions of drawings, 
called disabled and called normal, and sex 
of the people drawing them? For the disabled 
Ss, the chi square was .74, for the nondisabled 
Ss .007, and for all the Ss together it was 
3.60. For two degrees of freedom, chi squares 
as great or greater than these could occur by 
chance at least 70%, 98%, and 20% of the 


Table 1 


Chi Square Values and Significance 
Levels of Judgments 


Group Judge 


Group I 


Group II 


Group III 


groups 


time, respectively. This indicates no signifi- 
cant relationship existing between the place- 
ments of the and the sex of the 
drawer and permits one to infer that any sig- 
nificant results in succeeding analyses are due 
to judgments on cues related to disability, 
not sex. 


drawings 


Contingency tables between all three groups 
of judges and between every combination of 
two, resulted in probabilities that were in no 
Table 1). This signi 
fies that null hypotheses of no difference in 
judging ability between groups could not be 
rejected (Hypotheses 4 and c). That is, it is 
not clear from this study whether cognitive 
affective, or naive judges, as a 


case less than .30 (see 


group, are 
superior in judging figure drawings. 

The judgments of each group were tested 
by chi square against a chance expectancy 
of .5, since judgments were between just two 
categories. The results found in Table 1 indi 
cate Groups I and III were significantly bet 
ter than chance in judging the drawings 
These results, and the p of .01 for the total 
judgments, permit a rejection of the null 
hypothesis of no differentiability between the 
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two groups of drawings (Hypothesis a). 
Drawings by physically disabled persons can 
be differentiated from those by physically 
normal persons under the conditions of this 
study. 


Discussion and Conclusions 


The results of the study indicate there is 
no significant difference between the groups 
of judges involved, in their ability to differen- 
tiate human figure drawings. This may point 
to a worthwhile area for investigation, for 
although the number of judges per group in 
this study is small and the results on this 
point not decisive, the intimation is that the 
untrained judges did as well as the experi- 
enced judges in differentiating the drawings. 

Another outstanding point in the results 
is the wide variability between individual 
judges. Probabilities that the scores obtained 
were due to chance, ranged from as low as 
.005 to as high as .45. One conclusion that 
might be drawn from this is that facility with 
figure drawing analysis is in the main de- 
pendent upon some factor within the indi- 
vidual using the drawings and not on his 
particular knowledge of the theory of drawing 
analysis. 


In conclusion, the study suggests that fig- 
ure drawings really are projections of the 
drawer, but not merely as simple projections 


of self-image. More as Machover 


p. 262) says: 


(1953a, 


It is clear from the study of drawings of handi 
capped persons that the relation between body 
handicap and projection in the drawings is not a 
simple one. In the attitude toward a handicap there 
is the mediation of the whole personality. Drawings, 
as sensitive instruments recording realistic or shining 
self-evaluations, must be analyzed in the light of 
the whole personality 


It also implies that figure drawing tech- 
niques have possible value in clinical and 
counseling situations, but the wide variability 
among the judges indicates that the effective- 
ness and applicability of the techniques may 
be a function of the individual using them. 
The usefulness of the drawings will probably 
be complemented by other evaluation proce- 
dures and methods, and by background 
knowledge and understanding of the indi- 
vidual drawing the figures. 


Lyle D. Schmidt and John F. 


McGowan 


Finally, although the naive judges were 
quite accurate in their judgments, this should 
not be interpreted as a license for untrained 
persons to use figure drawings. Instead, many 
clinicians point out that since the drawings 
may contain considerable information about 
the drawer they must be handled profession- 
ally by competent persons until their clinical 
meanings can be more conclusively ascer- 
tained. 


Summary 


Human figure drawings were obtained from 
30 persons with visible physical disabilities 
and 30 persons without visible physical dis- 
abilities. They were randomized and presented 
to each of the members of three selected 
groups of judges who were requested to sort 
the drawings into “disabled” and “normal” 
piles. Group I judges were experienced in 
drawing analysis and were determined to have 
generally an “affective” professional orienta- 
tion. Group II judges were experienced in 
drawing analysis but had a general “‘cogni- 
tive” professional orientation. Group III 
judges had no experience in figure drawing 
analysis 

From a chi square analysis it was found 
that 


1. Figure drawings by physically disabled 
could be distinguished from figure 
s by physically normal persons, under 
the conditions of this study. 

2. There was no demonstrable superiority 
of any one group in the study in regard to 
drawing judgment, indicating that the group 
with no training in drawing analysis was able 
to sort the drawings with as great success as 


perst ms 


drawing 


either of the experienced groups. 

3. A wide variation existed between indi- 
vidual judges in the success of their drawing 
judgments. Probabilities that the discrimina- 
tions were due to chance varied from as low 
as 5 for some judges to .45 or more for 
others 
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CHLORPROMAZINE AND APPARENT MOTION 
PERCEPTION BY SCHIZOPHRENICS 


RAYFORD T 


Veterans Administration Hospital, Perr» 


If the perception of apparent movement 
is considered as a complex synthesis of the 
spatial and temporal aspects of the stimulus 
with apparent movement as the emergent 
(Saucer, 1954), a loss or deficit in the ability 
to see apparent movement may reflect both a 
specific and a general loss of ability to or- 
ganize and to synthesize perceptual processes 

It is commonly felt that schizophrenics 
show a deficit in their ability to organize and 
to synthesize. Experimental results indicate 
(Saucer & Deabler, 1956) that schizophrenics 
do experience difficulty in perceiving apparent 
motion, but that lobotomized schizophrenics 
do not show such a deficit. Since the earlier 
results were suggestive rather than definitive, 
a further study with schizophrenics using a 
larger N was carried out. It was found that 
the threshold for dissociation for apparent 
movemert in schizophrenia was significantly 
lower than for normals (Saucer, 1958). 

Clinical observation indicates that chemo- 
therapies, such as chlorpromazine, may pro- 
duce profound changes in overt behavioral 
patterns. It is the contention of many clinical 
observers that the effects of chlorpromazine 
are similar to the effects of lobotomy. Porteu 
(1957) reports changes on maze performance 
parallel to those established for lobotomized 
patients. If there are also changes in percep- 
tual processes as a result of chiorpromazine 
medication which parallel changes in lobot- 
omy, our understanding of the mechanisms 
whereby such agents affect behavior may be 
enhanced. 

It is the specific hypothesis of this experi 
ment that chlorpromazine will enhance the 
perception of apparent movement, moving the 


1 Now with the Research Unit on Aging, Veterans 
Administration Center, Kecoughtan, Virginia 


SAUCER 


Point, Maryland 


schizophrenic toward the normal and toward 
lobotomized schizophrenics in this perceptual 
area. 


Methods and Apparatus 


(Ss) were 20 undifferentiated, 
and hebephrenic schizophrenics 
from the neuropsychiatric wards of the Perry 
Point Veterans Administration Hospital and 
18 individuals from the same wards who were 


Subjec ts 


catatonic 


(a) currently receiving chlorpromazine and 
had treated for at least one 
(6) who had been treated by chlorpromazine 
medication and had been removed from treat- 
ment not less than one month prior to testing 
(Some patients were removed from medica- 
tion after the experiment had been planne*’ 
but before it could be carried out.) 

In a previous study (Saucer, 1958) six of 
the Ss tested were 


been week or 


found to be receiving in- 
tensive chlorpromazine medication. Since the 
experiment planning, data 
these Ss and combined 
lata from 32 additional Ss chosen by 


was in 
were withheld 


present 
from 
with 
the psychiatric staff. Insofar as can be deter- 
mined, no systematic bias affected the choice 
of Ss, nor was the experimenter aware of the 
treatment mode during the experimertal pro- 
cedure 

Ea patient was brought into room whert 
d that the examination was special testing 

the ward psychiatrist. He was asked 
minutes in a dark during 
as engaged in trivial conversation 
lights were turned on. They 
Sylvania R11 Glow-modulator tubes 
I 1% in. green fresnel lenses placed l 
an opalescent screen. The lights 
red off and on 180° out of phase. The 
rate could be varied from 2.5 to 12.5 cps 


three room, 


he w: 
then 


stimulus 


Lucite 


tor-driven sweep, the sweep time being 





Chlorpromazine and Perception of Apparent Motion 


Table 1 


Mean and Standard Deviations 


Standard 


Mean Deviation 


Chlorpromazine 


5.215 cps 1.76 cps 
Untreated 


3 475 cps 


576 cps 


When the lights were switched on at 2.5 cps, the 
usual report was that of a greenish light swinging 
back and forth. The sweep drive was then turned 
on and the S asked to report what he saw and to 
tell the experimenter if there were any changes. When 
the S reported a black object in motion, indicating 
a loss of perception of apparent beta movement, 
the sweep was stopped and the alternation rate read 
from a calibrated dial 


Results 


The flash rate means and standard devia- 
tions are presented in Table 1. The results 
were analyzed by the Mann-Whitney (Sie- 
gel, 1956, pp. 127) test for group differences. 
The investigator is no longer willing to use 
parametric tests in this area because of the 
high incidence of correlated mean and vari- 
ances. 

For the two groups, a Mann-Whitney 
x/sigma ratio of 5.47 without correction for 
tie scores was obtained. This is equivalent to 
a CR of 5.47, when N is as large as 38. Tie 
correction may have raised the CR slightly, 
but since the CR of 5.47 would occur by 
chance only one time in fifty thousand, tie 
correction was deemed needless. 

It is felt that the null hypothesis of no true 
difference between the groups can be rejected 
when the probability of obtaining such results 
by chance is this low. Therefore, the investi- 
gator is willing to accept the difference be- 
tween the groups as real and to infer that the 
chlorpromazine medication can account for 
the difference 


Discussion 


Chlorpromazine apparently raises the thresh- 
old of perception of apparent motion in the 
schizophrenic. This change is indeed so marked 
as to be significantly different when small 
groups of treated and untreated individuals 
are examined. Obviously, the information that 
one group has been treated and another has 
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not is redundant, but it is of some importance 
to note that a chemotherapy which is believed 
to be clinically effective does normalize at 
least one aspect of the perceptual domain of 
the schizophrenic individual. 

The results of this study confirm the con- 
clusion that chlorpromazine and lobotomy 
may produce equivalent behavior in schizo- 
phrenics. It is of interest to note that whereas 
Porteus reports a loss of ability to carry out 
planned or prehearsed behavior after both 
lobotomy and chlorpromazine, the writer’s re- 
sults indicate a facilitation of perception in 
the specious present, which James (1950, p. 
609) has defined as a holistic or Gestalt-like 
unit of time other than a point moment oc- 
cupying the temporal space between the im- 
mediate past and the immediate future. 

Perhaps the contradiction is more apparent 
than real. It is felt by most workers that the 
prime effect of chlorpromazine is a dulling 
of affective processes, probably through the 
hypothalamus and reticular formation. Some 
functions, however, require an affective com- 
ponent. Simple perceptual organizations in the 
specious present may be hampered by affec- 
tive components, either in the form of anx- 
iety or in the form of over-loaded neural mat- 
rices bound to schizophrenic emotional proc- 
esses, whereas planned future behavior or pre- 
hearsal is a relatively sterile concept without 
the inclusion of an affective component as 
motivation. Thus, the results of both the pres- 
ent study and the maze study may be sub- 
sumed within a frame of reference with a core 
concept of schizophrenia as functional de- 
cortication (Saucer, 1958). 

Perceptual processes in the specious present 
may be while the capacity for 
prehearsal may be depressed by re aov.ng the 


facilitated 


affective component, either by surgical or 
chemical intervention. 

As a short final speculation, it may be in- 
teresting to consider the implication of this 
statement for the process of psychotherapy. 
It is generally accepted that chlorpromazine 
and other ataractic drugs can facilitate psy- 
chotherapy. This may be explained in part 
by the assumption that the individual is more 
accessible and perceptually sharper, but since 
a large part of the process of psychotherapy is 
based on prehearsed or planned future be- 
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havior, it would seem that while the use of 
chlorpromazine may well facilitate supportive 
therapy, its use may interfere with a long- 
range therapeutic program based on the mo- 
tivational structure of future social and per- 
sonal gains for the patient. 


Summary 


A group of 20 untreated schizophrenics has 
been compared with a group of 18 schizo- 
phrenics receiving chlorpromazine. It has been 
established that the chlorpromazine group has 
a significantly higher threshold for apparent 
motion dissociation, their perceptual processes 
becoming normalized. 

Experimentally, a program of testing before 
and after medication could be recommended, 
but since both previous studies (Saucer & 
Deabler, 1956; Saucer, 1958) have indicated 
a consistent deficit in the perception of ap- 
parent movement by schizophrenics, it is felt 
that the results obtained in this study can 
allow generalization on a basis of a treated- 
nontreated division of subjects. 


Rayford T. Saucer 


It is believed that these results are con- 
sistent with a theory of schizophrenia as a 
process of functional decortication. They are 
discussed with regard to Porteus Maze test 
results with schizophrenic patients, and it. is 
concluded that the results of the two studies 
are complementary rather than mutually ex- 
clusive. 
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NOTE ON A BIBLIOGRAPHICAL RESOURCE FOR 
STUDIES OF AFTERIMAGES 


JEROME D. SCHEIN 


University of Wisconsin 


Since Freeman and Josey (1949) published 
their findings, clinical psychologists have be- 
come increasingly interested in the afterimage 
of apparent movement as induced by the 
Archimedes (or Plateau) spiral. In studying 
this topic I have been struck by the seeming 
lack of research on the duration of the after- 
image. Current research in the clinical area 
has been concerned only with having the sub- 
ject report the presence or absence of the 
apparent movement following stimulation 
(Aaronson, 1958; Gallese, 1956; Price & 


Deabler, 1955). No investigation of the pos- 
sible differences in duration of the afterimages 
between clinical samples has been reported. 


The recent study by Eysenck, Holland, and 
Trouton (1957) contains in its bibliography, 
but does not discuss in its text, a doctoral 
dissertation by Wohlgemuth (1911) which 
was completed in 1910. Wohlgemuth’s investi- 
gations encompass five years of experimenta- 
tion, during which time he performed 34 
experiments which he reports and 11 addi- 
tional studies which he does not give in detail 
since they represent confirmations of previous 
workers’ results. In much of his research, 
duration of the afterimage is used as the de- 
pendent variable. Among the topics he covers 
are (a) the effect of color, (6) duration of the 
afterimage after repeated stimulations, (c) ef- 
fect of velocity of stimulus, (d) differences in 
afterimage following counter and clockwise 
rotations of the stimulus, and many others. 
The importance of this reference to research- 
ers in this area is manifest. 

Since the standard reference resources con- 
sulted, Boring (1942), Osgood (1953), and 
Stevens (1951), do not list this monograph 
in their bibliographies, it seems desirable to 


call to the attention of other investigators the 
work already accomplished by Wohlgemuth 
if it is not to be overlooked. The first edition 
of Woodworth (1938) does list this study in 
the bibliography, but a careful reading of the 
relevant sections in the body of the text does 
not turn up any mention of Wohlgemuth’s 
thesis. Thus, though the monograph is listed 
in two bibliographies, neither source gives 
any indication of the value of this reference. 

A_ brief from the 
of Wohlgemuth’s dissertation 


quotation conclusion 


(1911) should 
stimulate further the interest of clinicians in 
this study: 

To find the exception that proves the rule we have, 
as far as I can see, to look to pathology for assist 
ance; nature has to perform the 
crucis” herself. It may be argued, I 
there is a lesion in this theoretical 
ment,” the 
some 


“experimentum 
think, that if 
“centre ol move 
after-effect ought to be impaired under 
conditions, possibly intensified others 
For this purpose it would be necessary to examine 
a great number of cases, especially those where there 
is some lesion affecting the tract. I 

examinations on these time ago, 
owing to circumstances, could not pursue 
enough (pp. 114-115) 


under 


optic began 
but, 


them far 


lines some 
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THE RELATIONSHIP BETWEEN RECALLED FORMS OF 
CHILDHOOD DISCIPLINE AND 


In recent years, many articles have ap- 
peared in both the lay and professional litera- 
ture with the primary intent of advising par- 
ents to stop worrying so much about pseudo- 
expertness in parenthood. In general, the 
drift of informed opinion—vintage 1958—is 
in the direction of minimizing the rigidity of 
old authority, accentuating the fallibility of 
contemporary authority and, in essence, urg- 
ing the anxious parent to relax and enjoy her- 
self (himself) with her children. Doing what 
comes naturally is the current fashion and 
has been advocated by such spokesmen as 
Spock (1946), Baruch (1949), and Bruch 
(1953). 

The advocacy of the principle of “doing 
what comes naturally” with one’s children is 
but a single facet of a much larger problem 
relating to the inhibition of feeling expression 
in general. There is a growing body of clini- 
cal opinion which suggests that one of the 
primary dimensions of psychopathology is an 
inability to freely express feeling at subse- 
quent cost to the individual. The literature of 
psychosomatic medicine is replete with illus- 
trations of some of the cost in terms of struc- 
tural changes within the individual which 
lead to organic illness. 

Rogers (1956) is one of many who have 
emphasized the importance of accessibility of 
feelings to the self in maintaining adjust- 
ment. Advocates of the “doing what comes 
naturally” school of child rearing practice 
recognize the crucial importance of expressing 
genuine feeling in the parent-child relation- 
ship. Kanner (1944), in describing the par- 
ents of a series of very seriously disturbed 
children diagiiosed as suffering from early 
infantile autism, emphasized the coldness and 


PSYCHOPATHOLOGY 


NORMAN S. GREENFIELD 


University of Wisconsin Medical School 


139 


formality of these people and their inability 
to behave in a spontaneous or demonstrative 
manner. 

The central focus of this paper is not on 
discipline as a learning experience. Such an 
approach is well represented in Sears, Mac- 
coby, and Levin (1957). In the present study, 
discipline is viewed as an index of the kinds 
of feeling communication which may exist 
between parent and child. Spock (1950) has 
noted that the mode of discipline expression 
may serve as a sensitive indicator of the par- 
ent’s feelings toward the child. After espous- 
ing the salutary effect of “ an occasional 
swift slap on the behind ,’ Spock (1950) 
notes that “The inhibition against this and 
other brisk, direct, impulsive methods has 
played a part in pushing parents into more 
civilized and more damaging techniques” (p. 
76). 

Within this frame of reference the present 
study reflects an attempt to investigate the 
relationship between recalled forms of child- 
hood discipline practices and psychopathol- 
ogy. If indeed the parent-child relationship is 
crucial to personality functioning and mal- 
functioning, and if discipline practice is an 
indicator of a broader set of behaviors and 
attitudes, then the hypothesis suggested is 
that psychiatric patients will report a greater 
incidence of indirect discipline forms, while 
a control group will report a greater percent- 
age of direct expressions of discipline. 

Method 

The subjects (Ss) were 58 outpatients of 
a university student psychiatric clinic whose 
diagnoses ranged from relatively mild adjust- 
ment reactions to long-standing schizophrenic 
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Table 1 


Comparison of Mean Percentages for “Direct” Forms of Discipline for 
58 Patients and Matched Controls 


Patients 


Mean SD 
26.40 
25.49 
38.37 
21.92 
20.76 
33.75 
26.65 
40.29 
30.00 


Source 
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* Two-tailed test. 


reactions. There were 26 females and 32 
males. The technique of matched control was 
employed and for each patient a control S 
matched for sex, age, father’s occupational 
level, and size of home town community was 
selected from the university population. The 
average age of the patient group was 22.69 
years with a range from 18 to 38, while the 
average for the control group was 22.48 years 
with a range from 18 to 37 years. On a scale 
from one to seven, the mean socioeconomic 
status of the patients was 2.78 and for the 
controls 2.71, indicating a generally middle- 
class background. The average S from both 
groups came from home town communities of 
between 10,000 and 100,000 populations. 
Each S was given a questionnaire which con- 
tained nine of the usual forms of punishment 
the parents employed. This questionnaire had 
previously been used by MacKinnon (1938) * 
in a study which demonstrated differences in 
recalled forms of discipline among violators 
and nonviolators of prohibitions. In the pres- 
ent study, S was asked to check the forms 
most usually employed by the mother, the 
father, or by both. The forms of discipline 
were: being scolded or reasoned with; being 
spanked; being sent away from the group; 
being denied some pleasure; being made to 
feel that the S is not as good as his brother, 
sister, or other children; being made to feel 


1The author wishes to thank Donald W. Mac- 
Kinnon for permission to use the questionnaire which 
he designed 


Controls 


Mean SD 


that the child had hurt the mother or father; 
being made to feel that the child had fallen 
short of what was expected of him; being 
demonstration of affection by 
father; and being told by the 
mother or father that they cannot love the 
child when he is bad. 


denied any 


mother or 


rhese items fall roughly into two groups 
eatments: the first four items represent 
latively direct, “physical,” unambiguous 
means of while the 
second group represents a more subtle expres- 
sion of “psychological” discipline. Here the 
emphasis falls upon the arousal of guilt feel- 
ings in the child and the expression of disap- 
pointment or denial of love by the parent. 
For purposes of the present study the first 
group was thought of as “direct” discipline 
practice, while the second group was labeled 
‘indirect” discipline practice. 


discipline expression, 


Results 


In evaluating the results it was first thought 
necessary to establish the fact that any dif- 
ference found would be a function of differing 
emphases in method rather than of total num- 
ber of items responded to, and thus a com- 
parison of total responses for both groups was 
made with the finding that no significant dif- 
ferences exist. 

The percentage of the reported forms of 
‘indirect” discipline was computed for each 
S, and between-group difference for the vari- 
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ous parent-child relationships were analyzed. 
The results permitted rejection of the null 
hypothesis in all instances except in the case 
of fathers of girls wherein no significant dif- 
ference was found (see Table 1). 


Discussion 


It should be emphasized that no known se- 
lective factor influenced the choice of experi- 
mental Ss other than their status as psychi- 
atric outpatients. For a period of several 
weeks the questionnaire was administered to 
all patients coming to the clinic. 

At first sight, the data which indicates that 
the Ss in both groups reported that their par- 
ents relied more on physical than indirect 
discipline seem at odds with the generally 
held belief that parents of middle-class chil- 
dren use less physical discipline than do the 
parents of the lower class. This may well be 
the case in an absolute sense, but with re- 
gard to the ratio of “direct” to “indirect” 
discipline the present results are entirely con- 
sistent with a previous study (MacKinnon, 
1938) which utilized the same questionnaire 
with a group of Harvard students. The data 
presented by Sears, Maccoby, and Levin 
(1957) on the reported use of physical disci- 
pline add further confirmation to the impres- 
sion that the present findings in this regard 
are probably representative. 

To the extent that we can assume a rela- 
tionship between recalled and practiced disci- 
pline, what are some of the implications of 
these results? 

Implicit in the hypothesis is the belief that 
direct and indirect forms of discipline are re- 
lated to more general modes of direct and in- 
direct expressions of feeling or, more properly, 
to the relative ability or inability of the par- 
ent to express genuine feelings toward the 
child. It is assumed that the inability to come 
to terms with one’s own feelings regarding 
discipline-provoking situations is but one as- 
pect of a more generalized emotional sup- 
pression. It is this inhibition of expression 
which may be transferred to the child and 
thus give rise to emotional difficulties. In ef- 
fect, the child learns to respond as he has 
been responded to. 

The concept of limits becomes relevant in 
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terms of the mechanism of this learning proc- 
ess. The setting of limits by the parent in the 
direct method frees the situation of much of 
its fearful ambiguity and permits the child to 
express feelings which could not be expressed 
in a less well defined context which neces- 
sarily is fraught with all the dangers of the 
unknown. Phenomenologically, the child must 
first know where he stands before he can per- 
mit himself the risk. 

Another factor in this learning not to ex- 
press feelings is what Baldwin (1955) has 
called the parental technique of visible suf- 
fering. Baldwin pointed out that what Mac- 
Kinnon (1944) called “psychological” disci- 
pline and what the present investigator termed 
“indirect” discipline is, in fact, the parental 
expression of pain and shame which has a 
guilt-inducing effect on the child. In the pres- 
ent formulation we need take this only one 
step further and point out that the guilt feel- 
ings become prominent inhibitors of feeling 
expression. 

This interpretation does not imply a single 
cause for all the behavior pathology encoun- 
tered at the clinic wherein the study was car- 
ried out. However, the present formulation is 
presented vith the belief that inhibition of 
affective expression is a common nuclear cor- 
relate. Studies are currently in progress to 
test this relationship with regard to the re- 
called expression of parents’ positive feelings. 

The problem raised by the inconsistency in 
the results with regard to fathers of females 
will be left unanswered. It is tempting to dis- 
cuss some of the possible implications of the 
cultural role of the father of the female in 
this conext, but these pages have little room 
for sheer speculation. 


Summary 


The hypothesis that psychiatric patients 
will report a greater incidence of indirect or 
“psychological” forms of childhood discipline, 
while a control group will report a greater 
percentage of direct or “physical” discipline, 
was tested in a university population. The re- 
sults indicate general confirmation of the hy- 
pothesis. Theoretical implications in terms of 
viewing methods of discipline as behavior 
samples representative of more generalized 
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attitudes of feeling expression in the parent- 
child relationship are discussed. 


Received March 13, 1958. 
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FUNCTION OF MATERNAL PERSONALITY “’ 
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Considerable research and speculation con- 
cerning the effects of infant feeding practices 
on later personality development of the child 
have been reviewed by Orlansky (1949). He 
concludes that the indefinite and occasionally 
contradictory results of these studies indicate 
that the total maternal personality, rather 
than a specific feeding technique, should re- 
ceive the emphasis in studies on personality 
development of the infant. It is the total per- 
sonality of the mother, together with the 
standards and sanctions of her environment, 
which will determine her choice of breast or 
bottle feeding and the subsequent positive or 
negative effects thereof. It should be profit- 
able, therefore, to consider the mother’s choice 
of infant feeding technique as a specific be- 
havioral manifestation of certain dynamic per- 
sonality traits, and to attemst to identify 
those personality chaiacteristics which may 
be related to her feeding preferences. 


1 This article is based upon a section of a disserta- 
tion submitted to the faculty of the Graduate School 
of Columbia University in partial fulfillment of the 
requirements for the Ph.D. degree in the graduate 
program in Social Psychology; doctoral committee 
Otto Klineberg, Rosalea Schonbar, and Goodwin 
Watson to whom the author is most grateful 

The writer is also greatly indebted to Lewis Bern- 
stein, Chief Clinical Psychologist, VA Hospital, Den 
ver, Colorado, and to Patilu Farquharson, University 
of Denver, for their help and encouragement. Ac 
knowledgment is also due Paul Bruns and E. Stewart 
Taylor of the Department of Obstetrics and Gyne 
cology, University of Colorado Medical School, Den- 
ver, for their kindness in making available the facili 
ties of The Colorado General Hospital Obstetrical 
Out-Patients Department for this research 

2 Essentially this read at the 
American meeting, New 
York, 1957 

3 Now affiliated with the Jewish National Home 
for Asthmatic Children, Denver, Colorado 
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Very little work has been done in this area, 
however. Reports by Deutsch (1944, 1945), 
Kroger and Freed (1951), and Newton 
(1955) suggest that the satisfactory psycho- 
sexual and sex role adjustment of the mother 
may be an important factor in her desire and 
ability to lactate successfully. Sears, Mac- 
coby, and Levin (1957), in their recent ex- 
tensive study of child rearing techniques, in- 
vestigated infant feeding behavior in a large 
sample of American mothers. They found 
that for many of these women breast feeding 
had a sexual implication, and that those who 
“have a strong sense of modesty or anxiety 
about sex in general, may avoid breast feed- 
ing” (p. 77). The authors also found that in 
terms of average group performance, feelings 
of rejection of the infant, coldness toward it, 
or the mother’s lack of self-esteem did not 
characterize those who bottle fed their in- 
fants. Levy (1943) that rejecting 
mothers may be more inclined to bottle feed 
their children, but that overprotected chil- 
dren may be breast fed for an inordinately 
long time. 


reports 


In general, however, work in this area has 
not specifically dealt with choice of infant 
feeding technique in the prenatal period. In- 
formation concerned with psychological fac- 
tors related to mothers’ choice of infant feed- 
ing technique has been based primarily on 
interviews with groups of mothers at some 
time after the child’s birth or on clinical in- 
sight and speculation 

It is the purpose of this study to investi 
gate the relationship of prenatal choice of in- 
fant feeding technique to maternal person- 
ality and, more specifically, to maternal psy- 
chosexual disturbances. 
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Method 


Fifty-eight married, Caucasian women in 
the last trimester of their first pregnancy 
were used as subjects (Ss). They were of 
lower socioeconomic class status, and all were 
patients at the Obstetrical Out-Patient De- 
partment of Colorado General Hospi'al at 
Denver. They received, through the hospital, 
approximately the same reading material con- 
cerning infant care, and the same general 
information concerning lactation, care of 
breasts, etc. Ali were felt to be physiologi- 
cally capable of breast feeding by the medi 
cal personnel of the clinic. Ss participated in 
the study on a nonvolunteer basis. 

Ss were divided into two groups: those 
stating prenatally a desire to breast feed their 
infants (N = 35), and those stating pre- 
natally a desire to bottle feed their infants 
(N = 23). No significant differences were 
found between groups with respect to age of 
S, age of husband, sibling status of S or hus- 
band, educational level of S or husband, dura- 
tion of courtship, length of marriage, or 
whether or not this pregnancy was deliber- 
ately planned. 

Each S was interviewed for approximately 
one hour. The exact nature of the study was 


not revealed to the Ss. The interview was es- 
sentially nondirective, and E asked direct 
questions only to assure that the same sub- 
ject matter was covered for all Ss. Informa- 
tion obtained was concerned primarily with 
response to and attitudes concerning sex role, 
marriage, pregnancy, childbirth, and related 
areas, as well as historical material concerned 
with early family life. 

Where data were primarily of an interpre- 
tive nature, two or three clinical psychologists 
and the investigator classified the responses 
into appropriate categories. Only those areas 
on which there was virtually unanimous agree- 
ment were utilized in the preparation of the 
data. Differences between the groups were 
compared by means of the chi square test. 

Each S was subsequently given the Blacky 
test as part of a larger battery of tests. This 
material was scored quantitatively by the 
method suggested by Blum (1951). Statisti- 
cal significance was calculated for obtained 
differences, using the chi square technique. 


Results and Discussion 


Table 1 indicates that Ss choosing to bottle 
feed their infants were significantly more dis- 
turbed on the Blacky test dimensions of oral 


Table 1 


The Blacky Test: Differences Between Breast 


Breast Choice Grout 


(N =33 


Dimension Present 


Oral eroticism 

Oral sadism 

Anal expulsive 

Anal retentive 
Oedipal intensity 
Masturbation guilt 
Penis envy 

Positive identification 
Sibling rivalry 

Guilt feelings 
Positive ego ideal 
Narcissistic love object 
Anaclitic love object 


<a a 
Anmeuuan 


mm Ow 
mM hw 


(Same sex) 


= 


Note.—All significant differences obtained indicate that a greater prox 
choosing mothers are disturbed on the relevant test dimensions 


*® Significant’at or beyond .05 level. 
> Significant/at or beyond .01 level 
* Significant_at or beyond .001 level 
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Present Absent 
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21 
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14 
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Infant Feeding Technique and Maternal Personality 


Table 2 


Differences Between Breast and Bottle Choice Groups on Interview Material 


nencer 
Depe den 


Response to emotional pressure of pregnancy 
cependent/immature 
na 


Premarital conceptio 


Domiciled nts or parent-surrogates in 


addition to | 
Rejection of child/pregnancy 


““Mother-centered”’ 


rather thar 


sons r choice of feeding 
Reasons for desiring rooming 


ward child 


& Dyk (1950 


eroticism, oral sadism, anal expulsiveness, 
positive identification, sibling rivalry, guilt 
feelings, and positive ego ideal. Breast choice 
Ss were significantly more disturbed on the 
penis envy dimension. 

Table 2 presents data from the interview. 
These data suggest relatively greater depend- 
ency, rejection of the child, and conscious sex 
role dissatisfaction on the part of the bottle 
choice Ss. 

For the purposes of this discussion it is as- 
sumed the Blacky test is a valid measure of 
those dimensions of psychosexual disturbance 
indicated. If this assumption is granted, then 
it would appear that the psychoanalytic in- 
terpretation of these test results reflects, on a 
psychodynamic level, those characteristics as- 
sociated with the bottle choice Ss on the basis 
of the interview material. 

That is, the orality of the bottle choice Ss 
may be related to the dependency character- 
istics of that sample population, since the 
classic “oral character” is described as ex- 
ceedingly dependent and immature (Fenichel, 
1945, p. 497 ff.); the anal expulsiveness of 
this group may likewise be associated with 
their relatively greater rejection of the child, 


Breast Choice 


Bottle Choice 


Yes No Yes No 


anal expulsiveness being theoretically indica- 
tive of the concern of an individual with the 
thrusting-out and destruction of foreign ma- 
terial within the body (Fenichel, 1945, p. 
66 ff.); and the tendency of the bottle choice 
Ss to state less satisfaction with their sex role 
may be reflected on the psychodynamic level 
by their disturbance on the dimensions of 
positive identifications and ego ideal, as well 
as on their relatively more frequent choice of 
a narcissistic love object. These disturbances 
might well be etiologically related to an in- 
ability to adjust to a given sex role. 

It is also interesting to note the relatively 
greater narcissism of the bottle choice Ss 
within the framework of our cultural empha- 
sis on the breasts as indicators of feminine 
desirability. The narcissistic might 
well be with “preserving” her 
sexual desirability and bodily attractiveness 
through negating the functional use of her 
breasts, or refusing io chance an undesirable 
alteration in their contours. 

In addition to the above differences, bottle 
choice Ss were also relatively more disturbed 
on sibling rivalry. According to psychoana- 
lytic theory there is a strong relationship be- 


woman 
concerned 
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tween oral eroticism and sibling rivalry and 
jealousy. Fenichel states, “. . . the sight of 
a younger brother or sister at the breast often 
turns out to be the underlying factor in con- 
necting envy and jealousy with oral eroticism”’ 
(Fenichel, 1945, p. 490). Another sib taking 
away the child’s place at the mother’s breast 
is frequently the motivation for sibling ri- 
valry; envy is frequently linked with oral ac- 
tivity (Alexander, 1945, p. 50). Disturbance 
on the sibling rivalry dimension, therefore, 
might be directly related to the high orality 
of the bottle choice Ss. 

The relatively greater disturbance on “guilt 
feelings’ may possibly be related to the 
higher incidence of premarital pregnancy in 
the bottle choice group, or to their ambivalent 
feelings toward the coming child and toward 
their sex role which they have been unable 
to resolve. 

The fact that the women scoring as more 
disturbed on the penis envy dimension should 
be in the breast choice group might be re- 
lated to the psychoanalytic hypothesis that 
in the course of her psychosexual develop- 
ment, and specifically in her adaptation to 
the genital trauma, the young girl displaces 


her desire for a penis to the desire for a child 
(Fenichel, 1945, p. 90). It is possible that 
the bottle choice S may seek satisfaction in 
the pregnancy, which increases her ability to 
act out her dependency needs, but may reject 
the child. The breast choice S, on the other 


hand, seeks satisfaction in the child itself 
which may become a factor in the resolution 
of her genital trauma. Breast feeding may 
also serve as an acting-out mechanism for the 
breast feeding mother, whereby the passive 
nonfunctional breasts become active and func- 
tional. 

It is to be emphasized that the incidence 
of overlapping between groups on all dimen- 
sions is such as to preclude the possibility of 
generalizing the obtained results to a given 
individual case, solely on the basis of her 
choice of breast or bottle feeding. Further, it 
should be borne in mind that these results 
have not been cross-validated. 


Summary 


The goal of the present study was to at- 
tempt to determine the relationship of pre- 
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natal choice of breast or bottle feeding to 
certain maternal personality variables. Fifty- 
eight primiparous clinic outpatients were used 
as subjects. Two groups were designated: 
breast choice (V = 35) and bottle choice 
(N = 23). Groups were homogeneous with 
respect to a number of hypothetically perti- 
nent variables. 

Subjects were interviewed and subsequently 
given the Blacky test. On the basis of these 
measures it would appear that the subjects 
bottle feed their infants were 
more dependent, rejecting of the child, and 
dissatisfied with their sex role, and that they 
had more psychosexual disturbance. The bottle 
choice group was significantly more disturbed 
on orality, anal expulsiveness, positive identi- 
fication, sibling rivalry, guilt feelings, positive 
ego ideal, and narcissism as measured by the 
Blacky test. The breast choice group was sig- 


choosing to 


nificantly more disturbed on the penis envy 
dimension. Overlapping between groups was 
present in all measures, and these results are 
not automatically generalizable to any given 
single case 

These findings were discussed in relation to 
psychoanalytic theory. 
Received March 17, 
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DRAW-A-MAN TEST’ 
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In a study reported in 1949, Fiedler and 
Siegel (1949) demonstrated that it was pos- 
sible to identify a significant proportion of 
untreatable neurotic patients in their agency 
on the basis of an objective score derived from 
the subject’s drawing of a person. Using scor- 
ing criteria from the Goodenough Intelligence 
Test, they found that adult male patients 
whose drawings were deficient with respect to 
representation of the head and face did not, 
in most cases, benefit from psychotherapy. 

Two features of the Fiedler and Siegel 
study, the adaptation of an objective scoring 
system to a projective test and the prediction 
from this score of a meaningful behavioral 
criterion, appear to the present writers to be 
unique contributions which warrant efforts at 
their further understanding and application. 

Fiedler and Siegel (1949) interpreted their 
finding as follows: 

It is suggested that if Machover’s hypothesis that the 
face is the social feature of the drawing be accepted, 
the poor performance in drawing the face is indica- 
tive of inability to form that interpersonal relation- 
ship between patient and therapist which is the 
necessary context of the therapeutic process (p. 389). 


This formulation involves two related hy- 
potheses: (a) that the face is superior to 
other parts of the drawing as an index of 
social aspects of personality, and (6) that the 
demonstrated relationship between represen- 


1The authors wish to acknowledge the assistance 
of Richard Schiefelbusch and the staff of the Bu- 
reau of Child Research, University of Kansas, in 
the collection of the data and for the use of data 
previously collected by the Bureau 

2 Then at the University of Kansas 


tation of the face and outcome of therapy is 
a function of the patient’s inability to form 
interpersonal relationships. In the Fiedler and 
Siegel study, the first of these hypotheses is 
not explicitly tested. In preliminary sorting 
of the drawings for primitiveness of execu- 
tion, judges were in good agreement, “agree- 
ment seeming to center around the face as the 
differentiating area” (p. 387), but quantita- 
tive data were not collected. The present in- 
vestigation attempts a direct test of the hy- 
pothesis by comparing the relative efficiency 
of a “face” score and a “body” score as 
predictors of the dependent variable to be 
studied. 

The second hypothesis implicit in the Fied- 
ler and Siegel formulation, i.e., that represen- 
tation of the face is an index of the subject’s 
ability to form relationships, is offered by 
the experimenters as the most likely theoreti- 
cal interpretation of their findings. The pres- 
ent writers interpret “ability to form relation- 
ships” as a hypothetical construct of which 
nonimprovement in therapy is a negative 
indicator. The ability to form relationships 
would appear, however, to have relevance to 
other criteria as well. If there is a general 
ability to relate to others, it should have im- 
plications for behavior in many types of social 
situations. Similarly, if periormance in draw- 
ing the face is an index of ability to form 
relationships, then this performance should 
predict a variety of criteria of interpersona! 
effectiveness. The validity of both the con- 
struct and of the face score would be better 


established by evidence that performance in 
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drawing the face is related to more than one 
behavioral criterion. This study, therefore, 
investigates the relationship of performance 
in drawing the face to the criterion of soci- 
ometric status or popularity. It is reasoned 
that an individual who is unable to form 
relationships with others usually will not be 
well liked by others who know him; his in- 
ability to accept others will make it difficult 
for them to accept him. The selection of this 
criterion is not intended to suggest that the 
ability to form relationships is equivalent to 
popularity. An individual might be popular 
without being able to accept others or, con- 
versely, one might accept others without be- 
ing popular. It was desired, however, to 
employ as objective a criterion as possible. 
Popularity can be measured objectively and 
at the same time would appear at least to be 
relevant to the construct. 

The specific hypotheses to be tested are as 
follows: (a) that performance in drawing the 
face of the human figure is significantly re- 
lated to popularity, and (5) that performance 
in drawing the face is a better index of popu- 
larity than is performance in drawing the 
body. 

Method 


Subjects 


Ss were 103 children representing the entire 
enrollment, less absentees, of five fifth-grade 
classrooms in one small Midwestern city. This 
number represents about half the total enroll- 
ment of the fifth grade in the community. 
The group comprises 52 boys and 51 girls 
The mean IQ (California Test of Mental 
Maturity) is 105.03, with a range of 65 to 
147 and a standard deviation of 17.35. The 
mean age is 10.29 years, with a range of 9.25 
to 11.75 years. The socioeconomic status of 
the children was not studied on an individual 
basis, but the schools in which the classes 
are located are situated in a variety of resi- 
dential areas, ranging in the writers’ judg- 
ment from the poorest to a level above aver- 
age for the community. 


Procedure 


Two figure drawings, one of a male and one 
of a female, were obtained from each child 
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in group test administrations. All drawings 
were made on 814” by 11” smooth ditto paper 
with IBM mechanical pencils. An eight-min- 
ute time limit was set for each drawing, pilot 
studies having shown that this amount of 
time was adequate for all pilot Ss to complete’: 
a drawing. The children were instructed to 
draw the figure facing forward “so that you 
can see his whole face” and to draw the 
entire figure including head, body, arms, and 
legs. They were also told, as in the Good- 
enough instructions (1926, p. 85), to “draw 
the best picture you can.” 

Current theory assumes that when a nor- 
mal S draws both a male and a female figure, 
he tends to identify more with the figure of 
his own sex and to devote more care to its 
production. In the data analysis, therefore, 
the figure of the same sex was utilized in each 
case. The decision to use drawings of females 
for the girl Ss necessitated some slight modi- 
fications in scoring, since the Goodenough 
test was standardized on drawings of males.* 

The drawings were scored by one of the 
authors without knowledge of other data 
about the Ss. Three scores were obtained for 
each S as follows: (a) a face score, based on 
the 22 Goodenough items relating to the head 
and neck, (6) a body score, based on the 
remaining 29 items of the scale, and (c) a 
total Goodenough score, the sum of (a) plus 
(b). The 22 items of the face scale, as used 
by Fiedler and Siegel, are reproduced in ab- 
breviated form as follows: 


1 Head present—there must be an outline of the 
head 

6a Neck present—any clear indication of neck as 
distinct from head 

6b Outline of neck continuous with that of head, 
trunk, or both. 

7a Eyes present. 

7b Nose present 

7c Mouth present 


In the case of females dressed in skirts, one of 
the items (11b) could not be scored, and judgments 
had to be made on the intended length of the legs 
in order to score two others (12c and 12d). Two 
additional items refer to attachment or nonattach- 
ment of the legs to the trunk, but this point could 
be scored by extending a line through the part of the 
leg represented. There were, therefore, two of 51 
items on the female drawings in which scoring was 
more subjective than on drawings of males, and one 
item which was usually unscorable for girls. 





Popularity and Goodenough Draw-a-Man Test 


Both nose and mouth shown in two dimensions; 
two lips shown. 
Nostrils shown. 
Hair shown. 
Hair present on more than the circumference of 
head, better than a scribble. Nontransparent. 
Proportion of head area not more than %, nor 
less than Yo of trunk. 
All lines firmly drawn with correct joining.* 
Head outline without obviously unintentional 
irregularities. More than crude circle or ellipse. 
14f Features symmetrical in all respects: eyes, nose, 
and mouth in two dimensions. 
15a Ears present. 
iSb Ears present in correct position and proportion 
16a Eye detail: brow, lashes, or both shown. 
16b Pupil shown 
16c Eye detail: proportion: horizontal greater than 
vertical (except on profile). 
16d Eye detail: glance (in profile) .5 
17a Both chin and forehead shown 
17b. Projection of chin shoWn, chin 
from lower lip 


differentiated 


A popularity measure was obtained from a 
sociometric technique. The child’s score was 
based on the number of times he was chosen 
by his classmates in response to the following 
questions: 

1. Who would you say are the three best friends 


you have in this class? 


2. If you were going to have a party, which three 


children from this class would you be sure to invite? 
Since there were unequal numbers of children 
in the five classrooms, the popularity scores 
were converted to standard scores to permit 
pooling them in analysis of results. 

Because the full-length Goodenough is used 
as a children’s intelligence test, and intelli- 
gence is known to be a correlate of popularity, 
an independent measure of intelligence was 
considered desirable. A group intelligence test, 
the California Test of Mental Maturity, was 
therefore included in the analysis. 

Product-moment correlation coefficients 
were calculated among all the variables stud- 
ied, i.e., the three Goodenough scores, IQ, 
and popularity. 


Results 


Scores on the three Goodenough measures 


are presented in Table 1. It will be noted 


4This item could as justifiably be classified with 
the body scale, but it is retained here because it was 
included in the face items by Fiedler and Siegel 

5 Not applicable here since Ss were instructed to 
make full-face drawings 


Table 1 


Means and SDs of Goodenough Measures 


Score Mean SD 


Face 
Body 
Face and body 


12.30 
16.58 
28.88 


that face items 
the total score. 

Intercorrelations among measures are pre- 
sented in Table 2. The correlations obtained 
support both experimental hypotheses. A re- 
lationship significant beyond the .01 level is 
found between face score and popularity. No 
significant relationship, however, is discovered 
between body score and popularity 


account for about 43% of 


Discussion 


It will be noted that the relationship be- 
tween popularity and IQ (California Test of 
Mental Maturity) is about the same as that 
between popularity and Goodenough face 
score. To determine possible effects of intelli- 
gence on the latter correlation, a partial r was 
calculated in which IQ was held constant. 
The resulting coefficient of .22, significant 
beyond the .05 level, indicates that a rela- 
tionship between face score and popularity 
still remains after their mutual association 
with intelligence (as measured by the Cali- 
fornia) is taken into account. The objection 
might still be raised that the Goodenough 
face score is simply measuring another aspect 
of intelligence, i.e., nonverbal, visual-motor 
ability, in contrast to the abilities measured 
by the California. This explanation would not, 


Table 2 


Intercorrelations Among Measures 


Face 
and Popu 
jody 


Body larity 


689* 304" 
Body 902* ‘ 053 

Face and body .180 

IQ .299* 
Popularity 


Face wan” 


* Significant at the .01 leve 
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however, account for the fact the the Good- 
enough body score, also a nonverbal, visual- 
motor test and abont equally related to Cali- 
fornia IQ, is not associated with popularity. 

The empirical relationship obtained be- 
tween popularity and performance in drawing 
the face is in the direction predicted and is 
consistent with the theory that the face score 
reflects ability to form relationships. The 
finding is therefore interpreted as offering 
some additional support for the validity of 
this construct, as well as for the face score 
as a measure of it. At least two limitations 
of this conclusion should be noted. The first 
is that the correlation of face score and popu- 
larity, though significant, is not high enough 
to warrant its use for more than gross kinds 
of predictions. While the Fiedler and Siegel 
finding would appear to have immediate clini- 
cal application, the present result has less 
practical utility. 

A second limitation relates to the question 
of construct validity. The present writers 


interpret the construct “ability to form rela- 
tionships” to refer specifically to the S’s social 
responsiveness, or ability to accept others, 
and only indirectly to his social attractiveness. 
Although the rationale of this study involves 


the assumption that social attractiveness, or 
popularity, is a function of social responsive- 
ness, it was not assumed that the two are 
equivalent. It is theoretically possible, there- 
fore, that the empirical relationship between 
face score and popularity could be explained 
without reference to the S’s ability to form 
relationships. For example, a child who draws 
the face carefully may do so because he is 
characteristically neat and/or vain of his own 
good looks. Such a child might be attractive 
to other children because of his appearance 
even though he himself was not a responsive 
person. Fiedler and Siegel, likewise, did not 
imply that their criterion, nonimprovement 
in therapy, was a perfect criterion of inability 
to form relationships. Even though both stud- 
ies yield results consistent with the theory, 
alternative explanations are possible, and ad- 
ditional evidence would have to be accumu- 
lated for adequate validation of the construct. 
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The confirmation of the second hypothesis 
is interpreted as support for the Machover 
(1949) hypothesis that the face is the “social 
feature” of the drawing. As predicted, the 
face score has a higher correlation with popu- 
larity than has the body score. The question 
may be raised whether this finding could be 
a function of the particular selection of items 
in the Goodenough test. It is possible that 
other methods of scoring the body might yield 
scores with more relevance to social criteria. 
Inspection shows, however, that on the Good- 
enough test, scoring of the face and body 
items follows similar principles. For both 
parts of the drawing, the chief criteria appear 
to be detail, proportion, placement, and motor 
coordination. Thus it would appear that for 
this test it is the bodily part, not the scoring 
criteria, which accounts for the difference in 
the measures as indices of popularity. 


Summary 


This study investigated the related hypothe- 
ses (a) that performance of children in draw- 
ing the face of the human figure is related to 
popularity, and (5) that performance in 
drawing the face is a better index of popu- 
larity than is performance in drawing the 
body. 

The results, which support both hypotheses, 
are interpreted as consistent with the Mach- 
over-Fiedler-Siegel formulation that the face 
is the social feature of the drawing and that 
performance in drawing the face reflects the 
subject’s “ability to form interpersonal rela- 
tionships.” 

The need for further evidence to establish 
validity of the construct “ability to form 
relationships” is pointed out and discussed. 
Received March 17, 1958 
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SOME RELATIONSHIPS BETWEEN SOCIAL REINFORCE- 
MENT AND SCHOLASTIC ACHIEVEMENT * 


JOHN McDAVID, JR. 
lowa Child Welfare Research Station 


An earlier study (McDavid & Schroder, 
1957) has described several striking areas of 
individual variance in the assimilation of so- 
cial reward and punishment. Adolescent males 
were shown to differ widely in their behavior 
following both interpersonal approval and dis- 
approval statements from authority figures, 
especially in terms of their own evaluation of 
their performance after the occurrence of the 
interaction event. These results were inter- 
preted as an indication that social reinforce- 
ment (approval) and social punishment (dis- 
approval) have differential effective value for 
different individuals. 

The concept of reinforcement value has 
occurred in several phases of psychological 
literature. In Rotter’s (1954) social learning 
theory, although the precise relationship be- 
tween expectancy and reinforcement value 
has not been spelled out, there is certain 
experimental evidence that reinforcement 
value does influence the magnitude of the 
performance construct, behavior potential. 
Following Hull, Spence (1956) has also out- 
lined a mechanism whereby magnitude of 
reward is assumed to influence performance 
in certain complex learning tasks through the 
K (incentive) construct by increasing drive 
level, and a number of experimental studies 
have verified this set of assumptions. Seward 
(1951) has provided a quite comprehensive 


1 This research was executed while the author was 
an Assistant in Research at Princeton University, and 
was supported by the United States Office of Naval 
Research under contract Nonr 171-055, H. M. Schro- 
der, principal investigator. The author is grateful to 
H. M. Schroder for his assistance and for the use of 
the Situational Interpretation Test. The cooperation 
of Central High School of Hopewell Township, 
Pennington, N. J., in providing Ss is also gratefully 
acknowledged. 
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review of evidence for the motivating func- 
tion of reward. 

The study described here is an extreme 
generalization of the paradigms set forth by 
both Rotter and Spence. The operations exe- 
cuted in this experiment cannot adequately 
test the validity of such theoretical state- 
ments, but they do rest fully upon an as- 
sumption that reward value is related to per 
formance by means of some kind of motiva- 
tional incentive effect. 

Educators have customarily made the logi- 
cal assumption that individual deviations in 
scholastic performance from the level pre- 
dicted from intelligence measures were due 
primarily to differential motivation toward 
scholastic achievement. Although other factors 
presumably operate, this study proposes to 
test the hypothesis that over- and under- 
achievement of scholastic grades is directly 
related to the reward value of high grade 
marks for the individual student. It was nec- 
essary to assume here that scholastic marks 
are seen by the student as being derived from 
some kind of external nonobjective standards 
and, hence, that they constitute a form of 
interpersonal evaluation of him by his teach- 
ers. That is, a high scholastic grade is con- 
sidered here to be one form of approval or 
social reinforcement. 


Method 


A sample of 79 adolescent boys, constitut- 
ing the entire male population of the junior 
and senior classes in a small community pub- 
lic high school, was tested with the Situational 
Interpretation Test (McDavid & Schroder, 
1957; Schroder & Hunt, 1957). This instru- 
ment ‘has been standardized upon a sample of 
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750 adolescent males, and has been validated 
behaviorally in an experimental situation 
(Schroder & Hunt, 1957). Thus the SIT has 
been demonstrated to be a satisfactory method 
for assessing the effective value of positive (ap- 
proval) and negative (disapproval) interper- 
sonal events as reinforcers for an individual. 
The test includes eight positive event items, 
in which praise in an undertaken task is de- 
scribed verbally. S is instructed to select one 
of two alternative “interpretations” of this 
event, one describing assimilation of the ap- 
proval event (increased level of self-evalua- 
tion) and the other describing nonassimilation 
of the event (unchanged or decreased self- 
evaluation). For the purposes of this study, 
selection of the former (self-positive) inter- 
pretation was scored 1, and choice of the 
latter (self-negative) interpretation was scored 
0. Thus, scores for social approval assimila- 
tion ranging from 0 to 8 were derived, the 
higher scores indicating relatively greater ef- 
fective value of interpersonal approval as a 
reinforcer. Thus, according to the initial hy- 
pothesis, scholastic overachievers should score 
higher on the SIT approval scale than schol- 
astic underachievers. 

The characterization of individual students 
as either over- or underachievers may be exe- 
cuted with varying degrees of fineness. In 
view of the ill-defined nature of scholastic 
grade marks, it would hardly seem requisite 
to make use of refined statistical procedures 
in such a categorization. Grade assignment is 
the consequence of a partially subjective 
evaluation process on the part of individual 
teachers, based to some extent on their own 
personal standards for student performance 
and to some extent upon the comparison of 
the individual’s performance relative to other 
students. Such scores as scholastic grade-point 
averages, then, are lacking in precision. 
Hence, a crude over- and underachievement 
index was derived from the school’s official 
records on each S. Intelligence was measured 
by Otis Self-administering Tests of Mental 
Ability (Otis, 1922) given to each student 
in the ninth grade, and scholastic grade 
achievement was derived from grade point 
averages calculated through the last com- 
pleted school year. Consequently, the intelli- 
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gence test scores were approximately 18 
months old for the juniors and 30 months 
old for the seniors. Inasmuch as intelligence 
measures are assumed to be reasonably stable 
over time for a population of Ss such as the 
sample employed here, and in view of the 
crudity of the indices derived from these 
scores, this decision to utilize recorded scores 
rather than to make a special experimental 
assessment of intelligence seems justified. 
Likewise, the scholastic grade averages were 
derived from two years of school work for 
the juniors and three years of work for the 
seniors. Because of these differences, over- 
and underachievement categorization was exe- 
cuted independently for the two classes. 

The actual procedure for categorizing a 
group of overachievers and a group of under- 
achievers involved the use of decile rankings 
of scores. For each class, Otis intelligence test 
scores as well as scholastic grade averages 
were ranked from lowest to highest. These 
distributions were then divided into deciles 
numbered arbitrarily from 1 (lowest) to 10 
(highest). The individual’s decile standing on 
intelligence was subtracted from his decile 
standing on scholastic average, yielding a 
decile difference index of schoiastic over- and 
underachievement. A decile difference of two 
or more was arbitrarily selected as the cutting 
point for categorizing the two groups. Thus, 
Ss with positive decile differences of +2 or 
more were Classified as scholastic overachiev- 
ers, and Ss with negative decile differences of 
—2 or more were categorized as scholastic 
underachievers. Such classifications for each 
class of students were then repooled, provid- 
ing for the inclusion of 23 Ss in the over- 
achiever group and 21 Ss in the underachiever 
group 


Results 


Scores on the SIT approval event scale 
ranged from 0 to 8, and the mean score for 
the entire sample of 79 Ss was 4.70. The 
mean SIT score for the overachiever group 
was 5.78, and the mean score for the under- 
achiever group was 3.62, a difference of 2.16, 
which is statistically significant by simple 
t test at less than the .001 level of confidence 
(t = 10.48). 





Social Reinforcement and Scholastic Achievement 


An additional exploration of the relation- 
ship between the reinforcement value of social 
approval and the magnitude of scholastic 
over- and underachievement was executed by 
correlating SIT approval scale scores with 
scores representing the magnitude of the decile 
differences derived previously (which ranged 
from —6 to +5). Although the product- 
moment correlation coefficient between the 
two sets of scores for the entire sample of 
79 Ss was .46 (p less than .01) the relation- 
ship is by no means linear. Nevertheless, 
scholastic and underachievement does 
appear to be related to the effectiveness of 
social approval as a reinforcer for the indi- 
vidual student. 


over- 


Discussion 


These findings indicate that deviations of 
academic achievement from the level pre- 
dicted by intelligence testing may at least in 
part be accounted for in terms of the effective 
value of approval as a social reinforcer for 
the student. The more effective social approval 
is as a reward, the greater its incentive or 
motivational value, and consequently the 
more likely is the student to work toward a 
goal of high scholastic marks. 

There are undoubtedly a wide variety of 
factors which operate to determine the setting 
of an individual’s goals for achievement. For 
example, the relationship between assimilation 
of social reinforcement and scholastic achieve- 
ment might be expected to be 
stronger when the measurement of achieve- 
ment itself is based on such a variable as 
grade marks (subjective interpersonal evalua- 
tions) than when that measurement is based 
on scores from an independent standard- 
ized achievement test (objective impersonal 
evaluation). Even at their maximum objec- 


relatively 


tivity, school grade marks are in large part 
a function of the teachers’ own standards for 
the students’ performance. These standards 
are ordinarily set high, so that they tend to 
exert a unidirectional pull upward in terms 
of grade achievement. It seems logical to sus- 
pect that the individual for whom social ap- 
proval has less effective value as a reinforcer 
would tend to operate in terms of his own 


153 


personal standards for achievement and set 
his goals or aspirations according to some- 
thing other than the achievement of a high 
mark in school. On the other hand, however, 
the student for whom social approval consti- 
tutes a highly effective reinforcer would be 
expected to utilize such external standards as 
the teacher’s in setting his own goals, a fact 
which would be reflected in strong motivation 
toward scholastic overachievement. 

It seems wise to draw attention to an al 
ternative hypothesis which might account for 
the relationships observed here. There is with- 
out doubt a great subjective element in the 
teacher’s assignment of grades to students 
The notorious halo effect which characterizes 
subjective judgments of this sort could con- 
ceivably account for a relationship between 
scholastic grades and responsiveness to social 
approval. The student who is highly respon- 
sive to social approval and disapproval would 
be quite docile and therefore more likely to 
be highly esteemed by the teacher. Such a 
student’s scholastic marks might thus _ be 
favored in the process of grade assignment 
by teachers. 


Summary 


By means of a crude index of scholastic 
over- and underachievement, in terms of dis 
crepancies in grade marks from the prediction 
of intelligence test scholastic grade 
achievement was shown to be related to indi 
vidual differences in the effectiveness of social 
approval as a reinforcer as gauged by a 
test of subjective interpretations of approval 
events. The problem of the meaning of aca- 
demic grade marks was explored, and certain 
speculations about their role as externally set 
standards for achievement were outlined. 

Although the data reported here lack pre- 
cision in the measurement of absolute achieve- 


scores, 


ment as well as in determination of the exact 
relationship between scholastic achievement 


and intelligence, they are nevertheless appro- 


priate for demonstration of the hypothesis 
that the motivational differences assumed to 
account for scholastic over- and underachieve- 
ment may very likely be partially due to 
differences in the 


incentive or motivational 
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reinforcement value of positive evaluation by 
teachers for various individuals. 
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MMPI PROFILES OF EPILEPTICS’* 
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The Minnesota Multiphasic Personality In- 
ventory (MMPI) has found wide application 
in psychological evaluation, but has not been 
extensively studied in patients with epilepsy. 
Because of the need for additional aids for the 
differential diagnosis of epilepsy, a series of 
investigations has been undertaken to de- 
termine if epileptics show characteristic re- 
sponses to psychological tests. In a previous 
study (Hovey & Kooi, 1955), certain kinds of 
responses to intelligence test items suggesting 
momentary confusion were found to occur to 
a significantly greater extent among epileptics 
than in either nonepileptic brain-damaged or 
psychiatric patients. In a more recent study 
(Kooi & Hovey, 1957), selected subtests of 
the Wechsler (1946, 1955) scales were ad- 
ministered while electroencephalographic re- 
cordings (EEGs) were being made. Transient 
disturbances of higher integrative processes 
were significantly associated with paroxysmal 
abnormalities of brain rhythms. 

The current study was designed to appraise 
the MMPI as a possiile differentiator for epi- 
lepsy. The study was carried out in two 
phases, the first exploring the efficacy of a 
scoring key technique, the second assaying 
the differential value of profile analysis. 


Development of a Scoring Key 


Two groups of subjects (Ss), 25 in each, 
were used initially. One is referred to as the 
original epileptic group and the other as the 
“spells” group. The size of these groups was 
limited by the rigid criteria established for 
selection of Ss for both groups: a discharge 
diagnosis was required; a valid and reliably 

1 Calvin W. Taylor, Professor of Psychology at the 
University of Utah, critically evaluated the statisti 
cal procedures and provided constructive suggestions 

2 Also Neurology Section, Department of Psychi- 
atry, University of Utah College of Medicine 


filled out MMPI answer sheet obtained dur- 
ing hospitalization must have been available; 
an electroencephalographic examination must 
have been done during the early part of hos- 
pitalization primarily because of complaints 
of “spells” such a blackouts, transitory losses 
of consciousness, jerking episodes, or other- 
wise for suspicion of epilepsy. The Ss in the 
original epileptic group each had received not 
only a diagnosis including clinical epileptic 
manifestations, but also an EEG with par- 
oxysmal formations, either spikes or spike- 
and-waves. Clinical epilepsy was not con- 
firmed in the “spells” Ss, and they all had 
EEG records that were within normal limits 

Analysis of all 566 MMPI items were made 
to determine which might differentiate be- 
tween the two groups at or above the .05 level 
of significance according to the ,’ test. A to- 
tal of 37 potentially significant items emerged, 
from which scoring stencils were made. All 50 
Ss were scored with these stencils. New groups 
of Ss were also scored, with the stencils based 
on the original 50 cases. 

One 33 additional epileptic 
cases, one of 150 psychoneurotic cases, one 
of 5 psychotic 


consisted of 
cases, and one of 50 per- 
sonality-problems cases (such as “passive-ag- 
gressive personality,” “behavior disorder” or 
“sociopathic personality”). The latter three 
groups, when combined, are referred to as 
the psychiatric group. The new group of 33 
epileptics consisted of all the available cases 
in the discharge files with MMPIs. The three 
psychiatric groups were selected on a con- 
tinual alphabetical basis, beginning with A, 
until the fiftieth case for each group was 
reached. Established clinical diagnoses, suc- 
cessful execution of the MMPI, and release 
from a period of hospitalization during which 
these were done were the only criteria for the 


hew groups 
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Results from Use of a Scoring Key 


The two original groups of 25 each were 
differentiated with the scoring stencils be- 
yond the .001 level of significance by appli- 
cation of either x’ or rps. However, when the 
procedure was applied to the new groups con- 
sisting of a total of 183 Ss, the shrinkage in 
significance was marked. The stencils differ- 
entiated the new groups of epileptics from 
the total psychiatric group just beyond the 
.01 level according to rp;,. The arrays of 
scores were such as to indicate that the sten- 
cils would be of very limited use clinically. 

Readers might well be interested in the 
kinds of items which appear to differentiate 
best. For this reason, the 10 most significant 
ones are presented here. They appear in rank 
order according to significance (the first be- 
ing most significant), with T for true and F 
for false indicating in which direction the epi- 
leptics responded more frequently than did 
the group of controls: 


I seldom worry about my health (36-F); I have 
never had a fit or convulsion (154-F); I sometimes 
find it hard to stick up for my rights because I am 
so reserved (509-T); I feel anxiety about something 
or someone almost all of the time (337-T) ; I believe 
in the second coming of Christ (98-F); My worries 
seem to disappear when I get into a crowd of lively 
friends (451-T); My eyesight is as good as it has 
been for years (274-F) ; I am in just as good physical 
health as most of my friends (51-F); No one cares 
much what happens to you (252-F); I have never 
seen a vision (464-F). 


Assay of a Profile Analysis 


Since MMPI profile shapes generally have 
turned out to be superior to single scale scores 
for aids to diagnosis, the problem was reap- 
proached using profile analysis. One section 


(pp. 290-339) of “Basic Readings on the 
MMPI” (Welsh & Dahlstrom, 1956), is de- 
voted to profile analysis with the following 
articles: Profile analysis of the MMPI in dif- 
ferential diagnosis, P. E. Meehl; An anxiety 
index and an internalization ratio for the 
MMPI, G. S. Welsh; A technique for objec- 
tive configural analysis of MMPI profiles, 
P. L. Sullivan and G. S. Welsh; MMPI pro- 
files and personality characteristics, H. B. 
Hovey; The F minus K dissimulation index 
for the MMPI, H. G 


Gough; Personality 
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characteristics related to success in practice 
teaching, H. G. Gough and W. H. Pember- 
ton; and, The validity of MMPI interpreta- 
tions, K. B. Little and E. S. Shneidman. 

When the composite profile for the original 
epileptic group in the present study was com- 
pared by inspection with that for the “spells” 
group, differences between these profiles pre- 
sented possibilities for differentiation. A con- 
figural analysis similar to that used by Sul- 
livan and Welsh, referred to above, seemed 
in order. 

By inspection of the two composite profiles 
there were about 50 points of apparent dif- 
ference between pairs and triplets of scores. 
These were evaluated by applying the ,? test, 
and 14 of them each differentiated the two 
groups beyond the .05 level. Some of the sig- 
nificant differences are referred to as Vs in 
the profile. A V was considered as present 
when any scale score was more than one full 
raw-score point below a straight line from a 
score to the left and to the right. Other 
differences are referred to as “higher-than,” 
when a 7 score on a scale was more than one 
full raw-score point above a comparison scale 
Both kinds of determinations can usu- 
ally be made from inspection, use of a rule 
only occasionally being necessary. The 14 
points of difference or signs in the direction 
pilepsy were then looked for in the pro- 
files for each of the 50 original Ss, each pro- 

being assigned a total rating of from 0 to 
icross these 14 profile features. The two 

ups were compared for these ratings. Then 
eat h 
base 


and 


scale 


score 
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S of the new groups was given a rating 
d on the same kind of profile analysis, 
cemparisons were made between these 


new groups. 


Results from Profile Analysis 

The 14 significant profile signs were evalu- 
ated statistically in the cross-validating sam- 
ples of 33 epileptics vs. 150 psychiatric cases; 
all 14 tests yielded significant chi squares, 
ranging from a p value of greater than .001 
for the most significant, to a p value of .05 
for the least. The 14 signs are listed below in 
order of significance as shown in this cross- 
validating check. None of the MMPI validat- 
ing scales nor the Si scale entered into any 
combinations reaching significance. 





MMPI Profiles of Epileptics 


. V for D— Pd — Ma 8. 
. V for Mf — Pa— Ma 9. 
. Ma higher than Pa 10. 


1 V for Hs — Pd — Mf 
2 

3 

4. Hs higher than Pd 11. 
5 

6 
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V for Hs — Pd — Pt 

V for Hy — Pd— Ma 

Hy higher than Pd 

. Ma higher than Pd 12. Pt higher than Sc 

. Hs higher than Pa 13. V for Mf — Pa— Pt 

. Pa one of two lowest 14. V for Hy — Pd—Mf 
scores 


Since there appears to be no formula for 
expressing true relationships for the various 
groups, and since the true relationships would 
probably be somewhere between biserial r 
and point-biserial r, both coefficients are pre- 
sented. The 7, between the 14 point profile 
ratings and the original epileptic vs. “spells” 
group was .61, with a standard error of .18 
and a z of 3.37 (r»; = .48). That for the new 
group of epileptics vs. the three new psychi- 
atric groups combined was equal to .68 with 
a standard error of .06, and a z of 6.342 (ro»; 
= .49). The ry, values for the new epileptic 
group vs. the neurotic, psychotic, and person- 
ality-problems groups were respectively: .58, 
se = .14, 2 = 4.18 (ron; = .46); .69, se = .06, 
2 = 4.96 (rp 54); .71, se = .06, z = 5.46 
(7pni = .56); each being beyond the .001 level 
of significance. The profile ratings discrimi- 
nated the epileptics from the neurotics to a 
lesser extent than from the psychotic group, 
from the personality-problems group, and 
from the combined psychiatric groups. None- 
theless, the profile ratings separated epileptics 
from neurotics to a distinctly greater extent 
than did the originaily developed item key 
separate the epileptics from either the com- 
bined psychiatric group or from any of the 
subgroups. 
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Table 1 presents cumulative percentages for 
the various groupings surpassing different 
cutting points. As cutting points on the rating 
scale were raised to higher levels, higher pro- 
portions of the new epileptics were identified 
in relation to the three new psychiatric groups. 
If the midpoint for the new group of epilep- 
tics is used identifying one-half of the epilep- 
tics, only 9% of the psychiatric cases show 
up as false positives. Among this 9% five per- 
sons had had EEGs. Three of these EEGs 
were at least suggestively paroxysmal, and two 
of those persons had been observed by trained 
professional personnel in clinical seizures. 
These cases were not included in the epilep- 
tic group since the seizure phenomena had 
not been listed among the final diagnoses. 

Perhaps of interest was a special compari- 
son of the profile ratings of the original epi- 
leptic group vs. a supplemental new group 
drawn from the total psychiatric population 
of discharges. There were 71 psychiatric cases 
who had satisfactorily completed the MMPI 
and whose EEGs were within the normal 
range. When this special new group of 71 
psychiatric cases was compared with the origi- 
nal epileptic group for profile ratings, the re- 
lationship between such ratings and epilepsy 
WAS pis .66, se 07, 2 = 4.71 (rpm = 48). 
This is approximately the same as that for 
the original epileptic 
“spells” group. 

Of additional interest was the relationship 
between profile ratings for patients with par- 
oxysmal EEGs vs. those with nonparoxysmal 


group vs. the original 


EEGs irrespective of clinical diagnosis. Aside 
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from the original 50 criterion cases, there were 
188 cases in the discharge files with EEG and 
reliable MMPI <ecords. The 7,;, for this new 
group of 188 was .54, se = .06 (rp = .42). 
When the clinically determined brain-impair- 
ment cases were eliminated from this sample, 
the fry, = .50, se = .09, (rp = .35). When 
the clinically diagnosed epileptics were re- 
moved, the fyi, = .45, se = .11 (rp; = 35). 
These figures surpass the .001 level. 


Discussion 


For differentiating epileptics from psychi- 
atric patients, it appears that profile configu- 
ration of the MMPI is more sensitive than a 
scoring system based on item analysis. At any 
rate, the two experiments reported above in- 
dicate this, the one using item analysis yield- 
ing a differentiation on new epileptic and psy- 
chiatric groups of just beyond the .01 level 
of significance, and not attaining the .01 level 
when the epileptic group was separately com- 
pared with each of the three psychiatric 
groups, yet the profile rating system sur- 
passed the .001 level for each of the three 
comparisons in the cross-validating groups. 
The scoring key based on item analysis is 
considered too crude a differentiator for clini- 
cal use, whereas ratings of individual cases 
based on profile configuration give some 
promise. 

It should be emphasized that the profile 
rating cannot be used as an independent in- 
dicator of epilepsy. Table 1, which shows per- 
centages of cases at various cutting points in 
several diagnostic categories, illustrates the 
point. If a cutting point at 12 were used, only 
18% of the new epileptics would be picked 
up, but only 1% of the new psychiatric cases 
would be misidentified. A cutting point at 9 
would identify half of the epileptics, but 
would also pick up 9% false positives. If one 
at 4 were used, 90% of the epileptics would 
be identified but also over half of the psy- 
chiatric cases would be misidentified. Thus in 
a Clinical setting the profile rating should be 
used in conjunction with historical data, neu- 
rological signs, and evidence from additional 
neurological procedures including the EEG in 
the process of diagnostic formulation. The 
highest scores would receive the greatest 
weight because of the reduced possibility that 
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these would represent false positives. In cer- 
tain cases a high or low score may result in 
a clinical re-evaluation with a subsequent 
change in diagnosis. 

MMPI profile configurations for epileptic 
groups have been reported by Modlin (1947) 
and Grayson (1951). Modlin emphasizes the 
emotional maladjustment associated with it, 
appraising 87% of his group of 22 epilep- 
tics as being so maladjusted, the majority of 
these expressing anxiety or depression pre- 
dominantly in the profiles. The composite pro- 
file for our total group of 58 epileptics agrees 
well with Modlin’s appraisals, and has a con- 
figuration suggesting chronic anxiety. In fact 
it is sufficiently similar to the composite pro- 
file produced by a group of 60 anxiety-reac- 
tion cases previously reported by one of the 
authors (Hovey, 1949) so that it could be in- 
terpreted as an anxiety-profile. (The anxiety 
group in that report produced the following 
coded (Welsh, 1948) profile 21”783' 4695 
F-L/K. That for the epileptic group was 
21'73894-56FL/K. That for the combined 
psychiatric groups was 24'87136-95F-K/L.) 
The present study indicated, however, that 
the cross-validation epileptics were differenti- 
ated from cross-validation heterogeneous neu- 
rotic Ss at the .001 level of confidence. The 
significance of the role of anxiety in relation 
to the anxiety-like profiles in patients with con- 
vulsive disorders needs further investigation. 

Grayson reports that his epileptics most 
frequently showed a high neurotic type curve 
with elevated neurotic triad, but with a sec- 
ondary peak at Sc. Our composite epileptic 
group differed from this in having a second- 
ary elevation at Pt. He found the lowest point 
to usually be at Mf. This scale produced next 
to the lowest point for our group, but it was 
higher than for our composite control groups. 
He also found Pa and Ma to be low. The low- 
est for our group was at Pa, and Ma was 
among the lower half of the points, but Ma 
was higher than for our control groups. Con- 
sequently, low Mf and Ma seemed less asso- 
ciated with our epileptic than with psychi- 
atric conditions. 


Summary 


An MMPI scoring key was developed using 
a group of 25 diagnosed epileptics with par- 
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oxysmal EEGs and a group of 25 psychiatric 
patients who had complained of epileptoid 
“spells” but who had normal EEGs and no 
confirmation of epilepsy. When the key was 
applied to new groups, the shrinkage was 
considerable making it of little, if any, value 
for clinical use. On the other hand, profile 
analysis done on these same original groups 
when applied to new groups differentiated 
diagnosed epileptics and also all available 
cases with paroxysmal EEGs from the vari- 
ous control groups beyond the .001 level of 
significance. 

The most outstanding features of the pro- 
files of epileptics were low Pd and Pa in re- 
lation to other scores. 
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THE THEORY AND PRACTICE OF “THERAPEUTIC 
INTERVENTION” IN CHILD TREATMENT * 
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The literature reveals basic differences of 
opinion about the implication of permissive- 
ness and the application of limits in child 
treatment. Some leadiy , therapists (Rosen- 
thal, 1956; Schiffer, 1952; Slavson, 1943) 
consider unconditional permissiveness and un- 
restrained acting out the primary requisites 
for effective child therapy. They strongly ob- 
ject to the imposition of prescribed limits by 
the therapist, and see in it a dangerous tech- 
nique that undermines the very foundation of 
the therapeutic relationship. They maintain 
that no predetermined set of limits can ever 
be applied in psychotherapy because “thera- 
peutic intervention” must always be based 
on insight into the needs of the individual 
child and must vary accordingly. 

Other leading therapists feel that “the role 
of limits may be more important than our 
current interest in them would indicate” 
(Bixler, 1949, p. 1), that “limitations .. . 
are set up as a prerequisite to satisfactory 
therapy” (Axline, 1947, p. 131), and that 
“without limits there would be no therapy” 
(Moustakas, 1953, p. 15). 

The difference between the two schools of 
thought is succinctly summarized in their defi- 
nitions of permissiveness. According to the 
one approach, “Permissiveness means the ac- 
ceptance of all behavior as it appears in the 
(therapy) group, be it aggressive, hostile, de- 
structive, sadistic, masochistic, etc., without 
reproof, censure or restriction on the part of 
the therapist” (Schiffer, 1952, p. 256). To be 
sure, the therapist does not sanction such be- 
havior; he only permits it. 

According to the other approach, permis- 
siveness means the acceptance of all sym- 


1 This term was coined by S. R 
sonal communication 


Slavson in a per 
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bolic behavior as it appears in therapy, be it 
destructive or constructive, without censure or 
restriction. All feelings, fantasies, thoughts, 
wishes, passions, dreams, and desires, regard- 
less of their content, are accepted, respected, 
and allowed expression through words and 
play. Direct acting out of destructive behav- 
ior is not permitted; when it occurs, the 
therapist intervenes and redirects it into sym- 
bolic outlets. 

These formulations of permissiveness touch 
on issues that strike at the very core of one’s 
therapeutic concept and conduct. In the ab- 
sence of objective research evidence, these are 
10t issues that one therapist or one school of 
therapy can decide for another. At the pres- 
ent stage of knowledge, psychotherapists act- 
ing upon and reporting on different hypothe- 
ses will need to state their orientation, de- 
scribe their practice, glean evidence from their 
experience, and present it for scientific evalua- 
tion. 

This article, written from a child-centered 
viewpoint, aims to provide a rationale for the 
use of limits in play therapy, to discuss vari- 
ous limits conducive to effective therapy, and 
to suggest techniques of limit-setting as well 
as methods of dealing with limit-breaking. 

The following six statements are proposed 
as a rationale for the use of limits in indi- 
vidual and group play therapy. 

1. Limits direct catharsis into symbolic 
channels. One of the aims of therapeutic 
limits is to promote release through sym- 
bolic means. The unfulfillable nature of some 
of the children’s desires makes the setting of 
limits on direct acting out unavoidable. Cer- 
tain acts, such as murder, incest, thievery, 
and vandalism are absolutely forbidden in 
our society. Such acts may not be performed 
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in therapy either, except in effigy. Symbolic 
release enables children to channel even in- 
cestuous and destructive urges into harmless 
outlets and to develop sublimations com- 
patible with social demands and mores. Thus 
a child with oedipal entanglements may un- 
dress, hug, kiss, and make love to a mother 
doll. Obviously he may not act so toward his 
mother or his therapist. By setting limits, the 
therapist helps the child to change his object 
choice while allowing him gratification of sex 
interests through socially acceptable channels 
of playing, painting, modeling, puppetry, and 
discussion. 

A child who is angry with his father can 
stab or shoot a father doll. The aggressive 
child may symbolically destroy his parents, 
teachers, and therapist over and over again 
in his play and games, and learn from his own 
experience that his impulses do not actually 
kill anybody. The neurotic child may dis- 
cover that his inner impulses can be dis- 
charged into the playroom without dooming 
him, thus learning that his desires are not 
fatal and need not be so rigidly inhibited. 

2. Limits enable the therapist to maintain 
attitudes of acceptance, empathy, and regard 


for the child-client throughout the therapy 


contacts. It is reasonable to assume that a 
therapist cannot remain emotionally accepting 
and empathetic when the child attacks him, 
pulls his hair, paints his forehead, tears his 
shirt, or breaks his glasses. Such activities 
must be prohibited to prevent the arousal of 
anger and anxiety in the therapist himself. 
The ability of any person to tolerate aggres- 
sive attacks is not unlimited. The invoking of 
limits prevents the therapist from exceeding 
his own capacity for tolerance and enables 
him to remain consistently unperturbed and 
tranquil. To retain his role as ego ideal and 
identification model, the therapist must not 
come too close to the brink of his endurance. 
If the therapist questions his ability to stay 
calm and accepting when the child scatters 
mud all over the playroom, he should limit 
the spilling to the sandbox or to one corner 
of the room. If the therapist cannot tolerate 
the child’s painting of walls, he should limit 
the painting to paper or toys. 

It must be stressed that therapeutic con- 
trols always apply to behavior, never to words. 
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A therapist may set necessary limits on unde- 
sirable behavior, provided that he is permit- 
ting verbal and play outlets for the expression 
of feelings. If the therapist cannot tolerate 
the children’s conversation or finds it neces- 
sary to limit the symbolic content of their 
play, then it is unlikely that therapeutic gains 
will accrue. (This point is discussed more 
fully in another part of the article.) 

3. Limits assure the physical safety of the 
children and the therapist in the playroom. 
Several common sense health and safety limits 
must be set in the course of play therapy. 
Children may not drink polluted water, hang 
out of the window, or set themselves afire. 
They also may not endanger the life or health 
of the therapist: they are not allowed to 
throw sand in his eyes, cough in his face, or 
dent his cortex with a mallet. For his own 
safety, then, the therapist may not be at- 
tacked physically. Other important reasons 
for this prohibition will be discussed later. 

4. Limits strengthen ego controls. Many 
young children present behavior problems 
characterized by an inability to cope with 
socially unacceptable inner impulses. The aim 
of therapy with these children is not the re- 
laxation of superego functions but the tight- 
ening of ego controls. By setting limits and 
invoking prohibitions the therapist becomes 
the external authority figure whose values, it 
is hoped, the child will absorb through identi- 
fication and introjection. Without limits, ther- 
apy may only delay self-regulation, encourage 
narcissism, and lead to a false sense of om- 
nipotence. By encountering limits on some 
actions in an accepting atmosphere, the child 
learns to distinguish between wishes and deeds 
without negative consequences. He learns that 
he may feel all his feelings but may not act as 
he pleases. By accepting the child’s feelings 
and preventing his undesirable acts, the thera- 
pist reduces the child’s guilt and at the same 
time turns his wishes in the direction of re- 
ality controls. Thus the child comes to ac- 
cept and control impulses without excessive 
guilt. 

5. Some limits are set for reasons of law, 
ethics, and social acceptability. Children may 
not sexually play with each other in the play 
room because, among other reasons, it is so- 
cially unacceptable and against the law. A 





162 


child may not deliberately defecate in the 
sandbox or urinate on the floor, because it 
is socially unacceptable. A child may utter 
“smutty” words in the playroom to his heart’s 
content, but he may not yell profanities at 
passers-by or at the secretarial staff, again 
because it is socially unacceptable. 

6. Some limits are set because of budgetary 
considerations. Some limits are set simply be- 
cause of realistic monetary considerations. A 
child may not destroy expensive toys because 
they are expensive and clinics usually have 
limited budgets. Thus Bobo, the costly clown, 
cannot be hit with a sharp instrument; it is 
only “for boxing.” 


Therapeutic Limits and Parental Restrictions 


There is a vast difference between thera- 
peutic limits and parental restrictions. In dis- 
ciplining a child, parents and teachers gen- 
erally focus on stopping undesirable actions, 
not on liquidating the negative feelings mo- 
tivating these actions. The child is usually 
neither helped to bring out his troubled feel- 
ings nor provided with safe channels for ca- 
tharsis. The restrictions are frequently set in 
the midst of angry arguing, and are often 
punitive and inconsistent. More often than 
not the child is left with the sad conclusion 
that not only his deeds but also his feelings 
and wishes are disapproved of. 

Therapeutic limits help the child deal both 
with his feelings and actions. The therapist 
permits all verbal and symbolic expression of 
feelings but limits and redirects undesirable 
acts. The limits are always set in a manner 
that preserves the child’s self-respect. The 
limits are never punitive, arbitrary, or ca- 
pricious. They are treatment-motivated and 
are applied without anger or violence. The 
child is not rejected or shamed for resenting 
the prohibitions. His objections to the limits 
and his wish to break them are recognized 
and respected, and harmless channels for ex- 
pressing his feelings are provided. 

When limits are employed therapeutically, 
they may lead to voluntary acceptance by the 
child of the need to inhibit antisocial wishes 
In this sense limits are conducive to the de- 
velopment of self-discipline; through identifi- 
cation with the therapist and the values he 
personifies, the child achieves greater powers 
of self-regulation and self-command. 
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The Techniques of Limit-setting 


Both in therapy and in life, children need 
a Clear definition of acceptable and unaccept- 
able behavior. They feel safer when they 
know the boundaries of permissible action. 
Therefore, limits should be delineated in a 
manner that leaves no doubt in the child’s 
mind as to what constitutes unacceptable 
conduct in the playroom. It is preferable that 
limits be total rather than conditional. There 
is a clear distinction, for instance, between 
splashing and not splashing water on the 
therapist. However, a limit that states “you 
may splash me as long as you don’t wet me 
too much” is inviting a deluge of trouble. 
Such a vague statement leaves the child with- 
out a clear criterion for making decisions. 

Limits should be stated in a friendly but 
firm manner. Children do not readily accept 
restrictions invoked with a halting and hesi- 
tant mien. When presented clumsily, limits 
may become a challenge to children, evoking 
a battle of wills and focusing therapy on re- 
strictions rather than relationships. 

Limits must be presented in a manner that 
minimizes the arousal of resentment in the 
children. The very process of limit-setting 
should convey a spirit of nonpunitive and 
helpful authority. There are different ways 
of phrasing specific limits. At times the fol- 
lowing four-step sequence may prove helpful: 
a) The therapist recognizes the child’s feel- 
ings or wishes and helps him to express them 
as they are. (6) He states clearly the limit 
on a specific act. (c) He points out other 
channels through which the feelings or wishes 
can be expressed. (d) He helps the child 
bring out feelings of resentment that are 
bound to arise when restrictions are invoked. 

This approach is illustrated in the follow- 
ing play therapy sequence: Johnny, age nine, 
wanted very much to take a gun home and 
made his desire known. 


; I’m going to take this gun with me 
Therapist: It’s easy to see, Johnny, that you like 

the gun and would like to take it home 

Johnny: Ves, I would. Can I? 
The rule of the playroom is that all 
toys have to stay in here. But you may have 
gun whenever you come to the playroom 
I don’t like the rule 

You wish there weren’t such a rule 
I wish the rule was that you can take all 
s home 


Therapist 


Johnny: 
Therapist 
Johnny 
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Therapist: Such a rule you would really like? 
Johnny: (Smiling) Yeah, but then you wouldn’t 
have a playroom. 


It is not always necessary or feasible to 
phrase the limit in the above pattern. At 
times it is more effective to state the limit 
first and reflect feelings later. When a child 
is about to fire a dart gun or throw a block 
at him, the therapist might say—and he had 
better speak quickly—‘Not at me—at the 
toys.” He will do well to point to the toys in 
order to distract the child from himself. He 
might then reflect the child’s wish to shoot at 
him and perhaps suggest to the child some 
harmless ways of expressing anger; e.g., “You 
may draw my face on the blackboard and 
shoot at it, or you may write my name on 
Bobo and punch it.” 

Limits should be phrased in a language that 
does not constitute a challenge to a child’s 
self-respect. Limits are heeded better when 
stated succinctly and impersonally. “Time is 
up for today,’ is more readily accepted than 
“Your time is up and you must leave 
now.” “No shooting at each other” is obeyed 
more willingly than “You must not shoot at 
Johnny.” Whenever possible, limits should be 
stated in the passive rather than the active. 
“Walls are not for painting” is accepted with 
less resistance than “You must not paint the 
walls.” “Toys are not for breaking” is better 
received than “You may not break toys.” 

At times limits may be set nonverbally. 
When a child “plays” the xylophone with a 
hammer, the therapist may hand him drum- 
sticks and take away the hammer. This can 
be accomplished without a word, just with a 
smile. The child may not even be aware that 
a limit has been invoked. The therapist may 
even be thanked for providing the appropriate 
tools. 


Situational Limits 


The materials and toys as well as the physi- 
cal setting of the playroom should be so 
planned that they exert a “limiting” influ- 
ence on children. Undesirable behavior can 
be prevented by removing in advance objects 
used for inappropriate acting out. Sharp or 
pointed toys should be taken out of the play- 
room before a session with aggressive chil- 
dren. Fingerpaint should not be given to over- 
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active children; it overstimulates them and 
invites smearing of each other and the walls. 
Toys should be sturdy and hard to break. 
Windows, lights, and one-way vision glass 
should be protected with wire mesh. Floors 
must be waterproofed and walls readily re- 
paintable. Office desks with stuffed drawers, 
overdecorated chairs, telephones, and personal 
books have no place in a playroom; their 
protection necessitates the setting of too many 
limits and interferes with the therapist’s abil- 
ity to maintain “free-floating attention.” 

It is advisable that the playroom be sound- 
proofed and isolated from clinic offices so 
that play therapy will not interfere with the 
other activities of the agency. 


Limits Conducive to Effective Therapy 


A time limit. A time limit is always neces- 
sary in child therapy. A play therapy session 
usually lasts 50 minutes. The therapist tells the 
child of the time limit and toward the end of 
the session reminds him that he has only a 
few minutes left to play. The therapist will 
say, “There are only five minutes more before 
time is up.” He may also give the child a 
one-minute reminder. At the end of the hour 
the therapist will get up and say, “Time is 
up for today.” With young children he may 
add, “Now we go out.” The therapist should 
adhere to the time limit consistently. He 
should not prolong the session even if the 
child brings out “significant material.” The 
child gains security from the predictability 
of the therapy hour. 

Toys may not be taken out of the play 
room. Sooner or later children want to take 
toys home. They may want to borrow, ex- 
change, or buy playroom toys. A limit should 
be set stating that “all the toys must stay in 
the playroom.” Toys may not be taken home 
or to the waiting room or to the bathroom. 
If the child wants to show a specific toy to 
his mother, he may invite her to see it in the 
playroom. However, children are allowed to 
take home any painting or clay sculpture 
made by them during the session. The limit 
on taking toys home pertains also to broken 
toys. They too may not be removed from the 
playroom by the child. The reason is obvious: 
too many toys would be broken for the pur- 
pose of taking them home. What is the ra- 
tionale for not allowing children to take toys 
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home? Besides obvious budgetary considera- 
tions, there is a therapeutic reason for not 
giving toys to patients: the relationship be- 
tween therapist and child should be based on 
emotional, not material sharing. 

Breakage. Children are not permitted to 
break room equipment or expensive toys. 
They may not pierce the rubber clown with 
a sharp instrument. They may not break the 
window or throw sand in the air conditioner. 
Their wish will be recognized but a limit in- 
voked. The therapist may say, for example, 
“You would like to cut Bobo to pieces but 
he is not for breaking. He is for punching,” 
or “The air conditioner is not part of the 
play material. It is part of the equipment of 
the room.” 

Physical attacks upon the therapist. Child 
therapy literature shows general consent about 
the necessity of prohibiting physical attacks 
on the therapist. Both analytic and client- 
centered therapists do not believe that physi- 
cal attacks are helpful either to the therapist 
or to the child. The rationale for this prohibi- 
tion is as follows: (a) It assures the physical 
safety of the therapists: As Dorfman (1951, 
p. 258) puts it, “It saves wear and tear on 
fragile therapists.” (6) It saves the child 


from guilt, anxiety, and the fear of retalia- 
tion. (c) It allows the therapist to remain 
emotionally accepting of the child. 
Some therapists (Slavson, 1952, p. 
allow young children of preschool age to at- 
tack them physically. They interpret to the 
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children the reason for the attack. Other 
therapists modify this limit to state, “You 
may hit me a little, but you can’t really hurt 
me.” 

The writer believes that the limit against 
hitting the therapist should not be modified 
under any circumstances. There can be little 
therapeutic value in permitting a child to 
attack an adult. Effective therapy must be 
based on mutual respect between the child 
and therapist, with the therapist never abdi- 
cating his adult role. Allowing a child to 
dominate the relationship arouses too many 
insoluble problems both for the therapist and 
the child, and it anchors therapy outside the 
world of reality. Telling a child that he may 
“hit but not hurt” the therapist is asking him 
to make a too fine distinction. Such a vague 
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limit does not contribute to the security of 
the child or the peace of mind of the thera- 
pist. The child is irresistibly challenged to 
test out the prohibition and establish the 
“just noticeable difference” between hitting 
playfully and hurting seriously. 

Physical attacks among children. While 
most therapists agree that physical aggression 
toward the therapist must be prohibited, there 
are conflicting theories regarding the value of 
setting limits on physical attacks among chil- 
dren. Despart (1945, p. 223) believes that 
children, especially young ones, should be al- 
lowed to fight in the playroom. According to 
her, symbolic acting out through toys pro- 
vides children “only a limited means of re- 
lease.” The prohibition of physical attacks 
may seem to the child as “equivalent to cen- 
sorship—the type of which is often the basis 
of his own problem.” 

Slavson (1943), on the other hand, be- 
lieves that children, especially young ones, 
need external restraint when they. are over- 
aggressive. “Unless the children are checked 
by someone outside themselves their aggres- 
sion gains momentum and 
tensity” (p. 160). 

Some therapists allow aggressive fights but 
keep them under control by serving as referee. 
Axline (1947) is against this practice because 
‘It tends to involve the therapist in a role 
that calls for assumption of authority and 
judgment which might at times appear as par- 
tiality to a certain member or members of 
the group” (p. 137). Axline believes that “the 
ruling out of physical attacks should be one 
of the limitations of group therapy” (p. 137). 

In the opinion of the writer, there is little 
healing benefit in allowing children to attack 
each other physically. Besides the obvious 
danger of serious injury, such attacks merely 
serve to displace aggression from original 
sibling to substitute sibling. It is more thera- 
peutic to channel aggressive impulses through 
symbolic actions against inflated clowns and 
family dolls and into rivalrous target-shooting 
and other sublimatory competitive games. 


increases in in- 


When Should Limits be Presented? 


There are conflicting opinions regarding the 
optimal time of introducing limits. Some 
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therapists believe that limits should be stated 
at the onset of treatrment because children 
may feel betrayed and disappointed when con- 
fronted with limits unexpectedly. That this is 
true of some children can be seen from the 
following play therapy extract. 


When Eleanor, age 13, first encountered a limit 
she became quite upset and voiced her disappoint- 
ment openly. She said, “There isn’t a place in the 
world without restrictions. Even this place has some. 
I thought that here we could do anything we wanted. 
Now I see that even here there is no freedom. I am 
disappointed because I wish there was one place in 
the world without any restrictions at all.” 


Many therapists, including the writer, are 
of the opinion that limits should not be men- 
tioned before the need for them arises. There 
seems to be little advantage in starting ther- 
apy by invoking prohibitions on actions that 
may never occur. There are some disadvan- 
tages to this practice. The listing of limits 
may serve as a challenge to aggressive chil- 
dren and as catharsis-deterrent to submissive 
ones. 

When Tommy, age 8, first entered the playroom, 
he was told by his therapist, “You may play with 
the toys any way you want to but you may not hit 
me or break toys.” Tommy became quite upset and 
he said, “Oh, no, sir, I'd never think of hitting you.” 


Tommy hardly touched a toy during the next few 
sessions 


When Limits are Broken 


Child-therapy writers, with one or two ex- 
ceptions, do not acknowledge the obvious fact 
that limits are sometimes broken by child pa- 
tients. Few suggestions can be found in the 
literature for dealing with this phase of ther- 
apy. Even writers who emphasize the vital 
role of therapeutic limits omit specific discus- 
sion of what is to be done when a limit is 
broken. 

In dealing with limits-breaking, Axline 
stresses the therapist’s need to remain accept- 
ing of the child. Even when a child breaks a 
limit, the therapist should “stay right there 
with her reflection of feelings.” Axline recom- 
mends that the therapist try to prevent the 
breaking of a limit “if she can do so without 
engaging in a physical battle with the child.” 

The question is, what is to be done when 
the child does engage in a physical battle with 
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the therapist? Some therapists (Bixler, 1949; 
Moustakas, 1953) suggest that after an initial 
warning, the therapist should terminate the 
session and put the child out of the playroom. 

This writer objects to expelling a child from 
the playroom regardless of his transgression. 
Besides conveying rejection, ejecting a child 
is a dramatic way of telling him that he can 
defeat adults. This admission of failure on 
the part of the adult is of no benefit to the 
child. It may well prove to him the suspicion 
that he is hopeless and helpless; since he can 
defeat all adults, no one remains to help him. 
No blanket recommendation can be made on 
how to deal with the child’s aggression, since 
the therapeutic reaction will depend on the 
meaning of the child’s specific action. Some 
aggressive children cannot accept the therapist 
and his friendly overtures because they have 
never recognized or accepted any external au- 
thoritv. These children may need an experi- 
ence of submission to an adult who is firm, 
just, and strong. 

Wien Joel, age 9, insisted on throwing a 
chair at the therapist in spite of verbalized 
limits and reflected feelings, the therapist got 
up and said calmly, “i am bigger and stronger 
than you.” The boy put down the chair and 
started cursing the therapist, who helped him 
verbalize his choking anger. 

It must be added that this method of limit- 
ing aggression should be applied only in spe- 
cific cases when other means have failed to 
achieve results. However, when a child has a 
false sense of omnipotence, expressed in neu- 
rotic defiance, submission to the authority of 
an adult may be clinically indicated and may 
prove helpful. 

Another method used successfully by some 
therapists is to transfer a very defiant child 
from individual to group therapy. An aggres- 
sive child who insists on attacking the thera- 
pist and on breaking limits may be put in a 
group of older children. Usually, instead of 
continuing his defiance, such a child will seek 
the therapist’s friendship as a protection 
against the actual or anticipated aggression 
of the other children. The older groupmates 

requently 


are able to convey more directly 
and 


more potently than the therapist that 


limits must be observed. The following play 
therapy sequence from an article by the au- 
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thor (Ginott, 1958, p. 416) will serve as an 
illustration. 


Nine-year-old George resented bitterly the few 
limitations set in the playroom as part of therapy. 
He claimed to be “superman” and seemed intent on 
destroying the toys, damaging the room, and attack- 
ing the therapist. When George was transferred from 
individual therapy into a group of older boys, he 
tried to continue his aggressive pattern. However, in 
the group, “superman” George met some rival “su- 
permen.” 

When George threw a wooden block at 10-year-old 
David, the boy looked at him with surprise and in a 
very convincing voice, he echoed one of the playroom 
limitations. “This is not for throwing,” he said, “only 
rubber toys are for throwing.” When George deliber- 
ately shot a dart at his face, David became angry. 
He took hold of George, shook him and said, “Look, 
the playroom is for playing, not for hurting. This is 
the law in here.” 

“I’m above the law,” said George, “I’m superman.” 
“Shake,” said David, extending his hand, “I’m super- 
man too.” “I’m super-superman,” answered George. 
“And I’m super, super, superman,” retorted David. 
The boys burst out in a loud laughter. The therapist 
said, “Both of you are supermen and above the 
law?” “No,” answered George, “nobody here is su- 
perman and nobody is above the law.” 


A Different View on Limits 


Dorfman (1951, p. 262) reports that a 
number of client-centered therapists use only 


one criterion for therapeutic intervention: 
they only limit activities that interfere with 
their ability to remain emotionally accepting 
of the child. Some.of these therapists allow 
the child almosi complete control over the 
therapy situation: the children may paint the 
therapist’s face, take toys home, urinate on 
the floor, leave the playroom at will, miss ses- 
sions, or terminate treatment. 

Again it must be stated that thus far there 
have been no published research studies on 
the comparative effectiveness of different prac- 
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tices of limit-setting. As therapists with differ- 
ent rationales and orientations report the re- 
sult of their experimentations, the more fruit- 
ful treatment technique will become evident. 


Summary 


This article proposes a rationale for the 
use of limits in child-centered play therapy, 
discusses various limits conducive to effective 
therapy, and suggests several techniques of 
limit-setting as well as methods of dealing 
with limit-breaking. 


Received March 27, 1958. 
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A COMPARISON OF THE EFFECTS OF PHENOBARBITAL, 
PROMETHAZINE, CHLORPROMAZINE, AND PLACEBO 
UPON MENTAL HOSPITAL PATIENTS *’ 


MONROE L 


LEVIN 


Western State Hospital, Hopkinsville, Kentucky 


This paper is a report of a controlled study 
of the effects of phenobarbital, promethazine 
(Halpern, 1948), chlorpromazine, and placebo 
upon the observable behavior of hospitalized 
chronic schizophrenic patients. Informal ob- 
servation of mental hospital patients had in- 
dicated that incidental variables, which neces- 
sarily accompany the administration of medi- 
cation, may be sufficient to cause profound 
changes in patient behavior regardless of the 
supposed effects of specific drugs. It was 
anticipated that similar behavioral changes 
would occur in experimental groups when 
drugs or placebo were administered under 
properly controlled conditions. 

Others have called attention to the impor- 
tance of controlled studies in drug research 
with mental hospital patients. The possible 
effects of variables other than medication 
have been noted elsewhere (Feldman, 1956; 
Feldman, Lacy, Walker, & Garrez, 1956; 
Hall, 1955; Pollack, 1955; Wikler, 1956). 
However, much of the literature on the tran- 
quilizing drugs is concerned with reports of 
clinical and controlled studies in which inci- 

1 Promethazine was supplied by Wyeth Labora- 
tories as Phenergan. In addition, financial support 
was also supplied by Wyeth Laboratories. Chlor- 
promazine and chlorpromazine placebos were gener- 
ously supplied by Smith, Kline, and French Labora 
tories as Thorazine and Thorazine placebos 

2 The author wishes to acknowlege the assistance 
of Raymond Norris, George Peabody College, who 
served as statistical consultant for the research 
Thanks are also due to Leonard Lund and Roy Ed- 
wards, who supervised the medical aspects of the 
procedures, Benjamin Fisher, and the nursing staff 
of Western State Hospital, Hopkinsville, Kentucky, 
for their assistance 

> Currently at Oneida Consolidated Schools, Oneida, 
New York. 


dental variables were not adequately con- 


trolled. 
Method 


Subjects (Ss) of both sexes were diagnosed 
schizophrenics and a variety of subtypes be- 
tween the ages of 21 and 50. All Ss were first 
admissions to a state mental hospital who had 
been hospitalized for at least two years. None 
of the patients had ever been diagnosed as 
being nonschizophrenic, nor had they received 
a course of treatment with any of the tran- 
quilizing drugs. In no instance had any S 
received insulin or electric therapy more re- 
cently than three months before the begin- 
ning of the research. 

The Wittenborn Psychiatric Rating Scales 
(1955) were the criterion used to evaluate 
each S. Ss were rated by nurses and aides 
trained to use the Wittenborn scales. Ss were 
rated within 24 hours before the onset of the 
experimental procedures, at the end of the 
treatment period, and two months after treat- 
ment was discontinued. Patients in the pro 
methazine, chlorpromazine, and placebo groups 
received tablets for four months. Ss in the 
phenobarbital group received the drug for 
three weeks. Dosages were: promethazine, 50 
mgs. qid; phenobarbital, one-half grain qid; 
chlorpromazine, 100 mgs. tid; placebo, tid. 
Placebo and chlorpromazine were identical in 
appearance. 

Routine procedures were established for all 
Ss. Only the author, a medical extern, and the 
supervising physicians knew which drugs Ss 
were receiving and were aware of the design 
of the research. When medical intervention 
was required, the supervising physician pre- 
scribed the procedures to be followed 


167 





Monroe L. Levin 


Table 1 


Analysis of Variance, Within Subjects 


Source of Variation df =x? 


Within Subjects 212 18,704.00 
Measurements 2 3,143.60 
Treatment X Time 481.05 
Error 204 15,079.35 


1,571.80 
80.17 
73.92 


* Significant at .01 level 


The Ss were assigned to drug groups by a 
medical extern. Attempts were made to keep 
each of the groups of equal size, so that as 
patients were located for the study, they were 
assigned to groups in sequence. Because 
phenergan, placebo, and chlorpromazine be- 
came available on different dates, it was not 
possible to assign Ss in a purely random 
fashion. 

At the beginning of the study there were 
34 Ss in the phenobarbital group, 30 in the 
promethazine group, 39 in the chlorpromazine 
group, and 36 in the placebo group. At the 
end of the study there were 27 Ss in the 
phenobarbital group, 19 in the promethazine 
group, 34 in the chlorpromazine group, and 
26 in the placebo group. 


Whenever side effects occurred, patients 
were withdrawn from medication for no more 
than three days. If it was not possible to 
reintroduce medication after three days, Ss 
were dropped from the study. Only one pa- 
tient, who was receiving phenobarbital, was 
dropped from the study because of side ef- 


fects (skin reaction). A few patients on 
chlorpromazine and one on placebo had mild 
skin reactions during the course of the study. 
Some patients receiving promethazine com- 
plained of slight drowsiness, but it was not 
sufficient to be of concern. 

When patients became disturbed and re- 
quired additional treatment, they were dropped 
from the study. A total of five Ss were dropped 
from the study because of the institution of 
other treatment procedures. The Ss who were 
dropped because of increased disturbance 
were fairly evenly divided among the groups: 
one chlorpromazine S; one phenobarbital S; 
one promethazine S; and two placebo Ss. 

Other Ss who were dropped from the study 


were dropped for administrative reasons. 
These included such factors as failure to ad- 
minister Wittenborns as scheduled and failure 
to administer medication at 2ppropriate in- 
tervals. 

It was predicted in advance that all sub- 
ject groups would show similar degrees of 
improvement. It was also anticipated that the 
improvement which occurred would disappear 
by the end of the two-month follow-up pe- 
riod, with all subject groups demonstrating 
the levels of behavior which existed at the 
time of the prestudy evaluation. 


Results 


Throughout the study Wittenborn scores 
were treated as global measures. The sum of 
cluster scores, rather than individual cluster 
scores, were used as an over-all measure of 
level of adjustment. 

Analyses of variance ‘were conducted in or- 
der to determine whether the results could be 
attributed to error, or interaction between 
extraneous variables and the procedures used. 
The analyses, summarized in Tables 1 and 2, 
showed that the results which were obtained 
were a function of the methods used rather 
than a reflection of other variables. 

Further statistical tests demonstrated that 
there was a significant decrease in symptoms 
in all groups from the prestudy to end of 
treatment periods, and a significant increase 
in symptoms from the end of treatment to the 
follow-up period. It was also found that there 
were significantly fewer symptoms in all 
groups at the end of the follow-up period 
than there had been at the beginning. 


Table 2 


Mean Wittenborn Cluster Scores 


Pre- Two 
study 
Evalu- 
ation 


Endof month 
Treat 


ment up 


Follow 


33.74 
31.53 
27.94 


29.38 


27.15 
21.34 
18.32 


24.88 





Comparison of Drugs and Placebo 











Placebo 


Promethazine 


Chlorpromazine 























PRESTUDY ENO OF 


TREATMENT 


Fig. 1 


The data clearly indicated that all groups 
showed effects while receiving 
placebo or medication. When treatment was 
discontinued, every group began to show an 
increase in symptomatology. By the end of 
the two-month follow-up period, the number 
of symptoms fewer than those which 


beneficial 


were 


had been present at the prestudy period. In 


an over-all sense, none of the drugs were su- 
perior or inferior to placebo. 


Discussion 


The results clearly supported the hypothe- 
sis which had been formulated in advance of 
the study. It is clear that all groups showed 
improvement of similar degrees and the prog- 
ress of every group (see Fig. 1) was strik- 
ingly similar. 

The general findings were clearly not due 
to medication which was administered. It is 
inferred that the changes which occurred in 
each patient group were due to the effects of 
suggestion, changed ward routine, receipt of 
medication by patients, and other variables 
which necessarily accompanied introduction 
of the experimental procedures. 

1 the chlorpro- 
treatment, al- 


The improvement noted 
mazine group at the end of 


TWO MONTH 
FOLLOW-UP 


Mean values of Wittenborn scores for experimental and placebo groups 


though barely significant in a statistical 
sense, is not sufficient to be attributed to the 
drug alone. This is particularly true since all 
other groups showed changes which were not 
greatly different from those of the chlorpro- 
mazine group. 

At the end of the two-month follow-up pe- 
riod, every group was moving rapidly toward 
its pretreatment level of adjustment. The di- 
rection which each 
Fig. 1) is such that 
that follow-up at a later period would have 
found every group at its pretreatment level 
of adjustment. Therefore it seems justifiable 
to conclude that the 
occurred in every 
tending to dissipate 

The implications of this study are clear. 
Incidental variables apparently play a major 
part in modifying patient behavior when any 


group’s curve took (see 
it seems safe to assume 


behavior changes which 
group were temporary, 


with time. 


medication is administered. In designing drug 
research, great care must be taken to control 
incidental variables. Caution must be used in 
interpreting uncontrolled clinical studies in 
which treatment methods are evaluated. 
Replication of this experiment would be of 
value. The method could used together 
with clinical evaluation to appraise drugs 


be 
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other than those used in this experiment and 
to explore the effects of more prolonged treat- 
ment and greater dosages than were used here. 
Addition of a patient group which was treated 
as every other group in this study, but which 
received no medication or placebo, would 
greatly clarify the effects of administration 
of medication upon a chronic hospital popu- 
lation. 


Summary 


This paper is a report of a controlled study 
of the effects of certain drugs and placebo 
upon the behavior of hospitalized, chronic 
schizophrenic patients. The study was de- 
signed so that the effects of incidental vari- 
ables which are assumed to be present in any 
treatment program could also be inferred. It 
was found that every group showed improve- 
ment while on treatment, that improvement 
tended to dissipate with time, and that vari- 
ables other than medication had a profound 
effect upon patient behavior. The implications 


L. Levin 


of the findings for future drug research were 
discussed. 


Received March 27, 1958. 
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AUTISTIC DISTORTIONS IN PARENTS OF 
RETARDED CHILDREN * 


G. H. ZUK 
St. Christopher's Hospital for Children 


In recent years a host of studies have been 
undertaken to test the theory that needs in- 
fluence perception or cognitive processes. Few 
of these studies have concerned themselves 
with the parent-child relationship, in which it 
is often taken for granted that autistic proc- 
esses—that is, processes by which needs in- 
fluence perception—abound. Perhaps this is 
the reason for the scarcity of controlled, 
quantitative observation and evaluation. Un- 
fortunately, merely assuming that potential 
for autistic distortion exists in the parent- 
child relationship is of little value in under- 
standing why it is used, how, and under what 
conditions. 

Levine, Chein, and Murphy (1942) defined 
autism as it is to be used in the present con- 
text: it is the process which causes a person 
to see what he wishes to see. Counterbalanc- 
ing autism is something to which Levine, 
Chein, and Murphy refer as the “realistic” 
process: that which causes a person to see 
things as they are objectively, regardless of 
personal needs. Their study showed that 
hungry subjects gave increasingly greater 
numbers of food associations to ambiguous 
perceptual stimuli. This they interpreted as 
evidence of an autistic process. But with in- 
creasing hunger, subjects made fewer food as- 
sociations. This was interpreted as evidence 
of the “realistic” process. 

It is of interest to note that Murphy’s 
(1947) view that “needs keep ahead of per- 
cepts” bears more than just a passing resem- 
blance to Freud’s concept (1931, pp. 405- 


1 This research was carried out in the Mental Re- 
tardation Clinic, which is supported by grants from 
the Children’s Bureau, Department of Health, Edu- 
cation and Welfare and the Pennsylvania State De- 
partment of Health, Division of Maternal and Child 
Health 


426) of unconscious wish-fulfillment. Freud 
too believed that needs or wishes could affect 
perception, especially if they were uncon- 
scious. He postulated that unconscious wishes 
often find outward expression by associating 
themselves with relatively “harmless” con- 
scious percepts. He suggested, furthermore, 
that wishes could distort the percepts because 
of their relatively great cathectic force. Al- 
though this normal function of the 
psychic system, Freud also explained some 
of the psychopathological hallucinatory psy- 
choses and hunger fantasies by it. 


was a 


Background and Rationale 


In the Mental Retardation Clinic? at St. 
Christopher’s Hospital in Philadelphia, chil- 
dren are seen to determine the presence and 
degree of retardation, and to develop a plan 
of future management. As part of a psycho- 
logical test battery administered, the writer 
uses Doll’s (1935b) Vineland Social Maturity 
Scale along with a standardized intelligence 
test, the Stanford-Binet, L, or Cattell Infant 
Intelligence Scale. The Vineland (as it shall 
be referred to hereafter) is widely used in 
work with retarded children 


and adults for 


the reason that it is not as dependent on 
actual participation by the individual as other 
intelligence or personality measures. The ob- 
servations of an informant can be used by the 
examiner to derive the terminal scores (social 
age (SA), and social quotient (SQ) ) 


For ob 
vious reasons, the informant in the usual clini- 
cal situation with retarded children is the 
parent. The parent can answer for the child 


2 The writer wishes to thank other members of the 
Clinic staff for efforts in behalf of the study: John 
Bartram, John Borriello, June Dobbs, Sidney Alt 
man, and Helen Beck and Doris Haar 
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when he (or she) is unable to or cannot or 
even will not perform for the examiner. 

In administering Vinelands and intelligence 
tests to the children at St. Christopher’s, it 
was noticed that marked discrepancies often 
occurred between the terminal scores of both. 
Specifically, SA and SQ were found to be con- 
sistently higher than MA and IQ in all but a 
very small number of cases. This appeared to 
contradict Doll’s (1936) reporting (p. 291) 
that, with the retarded, “the S.A.’s equal the 
M.A.’s (on the average) up to M.A. 4; there- 
after, the S.A.’s average one year higher,” in- 
asmuch as most of the children seen by the 
writer had MAs below 4. 

Doll’s early work with institutionalized re- 
tarded showed that, if anything, SQ tended to 
be somewhat lower than IQ, although in nor- 
mal subjects SQ and IQ were ascribed to be 
roughly equal. In one study, Doll (1935a) re- 
ported a median SQ of 62 as compared with 
a median Binet IQ of 65 for a group of re- 
tarded young adults with a median CA of 20 
years. In a more extensive study of the re- 
tarded (Doll, 1936), he reported a median 
SQ of 45 as compared with a median Binet 
IQ of 56 for a group of young adults with a 
median CA of 19 years. 

On the other hand, with a group of ado- 
lescent and young adult retarded, Schmidt 
(1946) reported findings which were the re- 
verse of Doll’s. Testing her group at different 
ages during adolescence, SQ consistently aver- 
aged higher than Binet IQ in each of the com- 
parisons shown. 

The apparently contradictory reports of 
Doll and Schmidt can be resolved by refer- 
ence to the unique feature of the Vineland 
namely, the use of an informant. Doll’s sub- 
jects were institutionalized patients; Schmidt’s 
were boys and girls attending special classes 
in a city school system. Doll used as inform- 
ants cottage attendants.’ Schmidt was not in 
so relatively controlled a situation. Although 
she does not report who her informants were, 
it may be reasonably assumed that the par- 
ents served in this capacity. 

The contradiction, then, may be explained 
as a function of the “objectivity” of observa- 

8 The use of cottage attendants as informants was 


confirmed in a personal communication from E. A 
Doll. 
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tion of the subject’s behavior. This explana- 
tion finds considerable support in clinical con- 
tact with parents of retarded children. A 
common characteristic of these parents is to 
not only falsely picture the abilities of their 
children but, also, to distort information which 
is given to them for the purpose of correct- 
ing the false picture. Psychologists using the 
Vineland are not completely unaware of this 
tendency, and some notation is usually made 
to the effect that a parent seems to be over- 
estimating his child’s ability or, for that mat- 
ter, underestimating. 

This study attempts to make a systematic 
appraisal of the use of different informants on 
the Vineland. The leading hypothesis is that 
the more personally involved the informant, 
the more likely is the subject to be portrayed 
in a favorable light. To test this, the reports 
of both parents and teachers of retarded chil- 
dren were compared. To test for the presence 
of a “realistic” process which, as it were, in- 
hibits the tendency to autistic distortion, com- 
parisons were also made of the Vineland re- 
ports of parents whose retarded children were 
and were not impaired in motor functioning. 
It was the supposition here that motor im- 
pairment reduced the level of ambiguity in 
the parental perception of the child, thereby 
reducing the expression of autistic distortion 


Procedure and Results 


Tables 1 and 2 compare SQ and IQ infor- 
mation on retarded children who had been 
seen independently by two psychologists * at 
St. Christopher’s. Psychologist 1 had, in past 
years, examined children who were retarded 
both with and without motor handicap. Chil- 
dren labeled “retarded handicapped” and “re- 
tarded nonhandicapped” were selected by the 
writer from the files of Psychologist 1. The 
procedure followed was to select the first 100 
cases of retarded children (starting up the 
alphabet) who were considered, in the opin- 
ion of the examining psychologist, to have no 
motor handicap which unduly restricted per- 
formance on an intelligence or Vineland test. 
IQs were obtained from either the Merrill- 


*The writer, designated as Psychologist 2 in text 
und tables, was one of these. Psychologist 1 was 
William Mark of the Cerebral Palsy Clinic at St 
Christopher’s, whose help was most appreciated. 
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Table 1 


Intelligence and Vineland (with Parent as Informant) 
Test Scores Obtained by Two Psychologists from 
Groups of Handicapped and Nonhandi- 
capped Retarded Children 


Psychologist 1 Psychologist 2 


Retarded 
Children, 
Handi- 
capped 


Retarded 

Children, 

Nonhandi 
capped 


Retarded 
Children, 
Nonhandi- 

capped 


95 
10-89" 


he) 


50 
Range 14-80 


Median 
SQ 

V 905 
26-100 

64 


Range 


Median 


which would 
’ “retarded” 
children were 


ied level 


*Af h IQs of an order 
them as “dull r I rather thar 
in the st 
*xaminer function at a retar 


le 4 


were inclu 
their 


felt by 


These 


Palmer or the Stanford-Binet. This retarded 
nonhandicapped group (as it shall be here- 
after called) consisted of children aged 1 
through 8 years, with a median mental age 
of 2 years and 9 months. Five of the 100 cases 
were subsequently discarded because of com- 
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plicating clinical signs, such as a suspected 
personality disorder. 

Entering the same files, 25 cases of re- 
tarded children were selected who, in the 
opinion of the examining psychologist, had a 
motor handicap of sufficient scope to restrict 
the performance and incur a penalty on an 
intelligence or Vineland test. More than half 
of this group had a medical diagnosis of 
cerebral palsy. It shall hereafter be referred 
to in the text and tables as the retarded 
handicapped group. Its age range was from 1 
through 8 years, and the median mental age 
was 3 years and 8 months. Three cases were 
discarded from the group of 25 for their pos- 
sible effect on the homogeneity of the series. 
The writer’s group (Psychologist 2) was com- 
posed of 50 cases which fell into the retarded 
nonhandicapped class. The age range was 
again from 1 through 8, and the median men- 
tal age was 1 year and 10 months. 

Tables 1 and 2 illustrate that SQ was con- 
sistently rated higher than IQ in the case of 
the retarded nonhandicapped children. In the 
case of retarded handicapped children, this 
directional tendency was not found. The trend 
was for SQ to be rated somewhat below IQ 
In both groups, examiners relied on the 
formation supplied by parents. 


in- 


Unquestionably, it is somewhat easier for 
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* Indicates that SO is higher than IQ. A minus sign would indicate the reverse, i.e 
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examiners to determine what a child with a 
motor handicap cannot do than what he can; 
the motor handicap clearly precludes perform- 
ance, for example, on many of the items of 
the Vineland. On the other hand, it is not as 
easy to determine what a child without an 
observable motor handicap (but retarded) 
cannot do. Was the difference between SQ 
and IQ due to the examiner’s bias which pre- 
disposed him to rate “more accurately” if his 
subject had a penalizing motor handicap? 

In order to control for this bias (and at 
the same time point up the constant bias in- 
troduced by the use of parents as inform- 
ants), the writer had parents and teachers of 
retarded children attending a nursery school ° 
in Philadelphia fill out the Vineland inde- 
pendently. Table 3 reports SQ and IQ infor- 
mation obtained. The writer used as subjects 
those children at the nursery school who fit 
more nearly the category retarded nonhandi- 
capped, although it cannot be said that these 
children were entirely free of penalizing mo- 
tor handicap. The important consideration, 
however, is that parents and teachers were at 
liberty to judge the behavior of the children 
on the Vineland without an examiner’s inter- 


pretation of their information. 
In filling out the Vineland independently, 


5 The writer is indebted to Sister Grace Jones, Di- 
rector of the Kensington Dispensary in Philadelphia, 
for permission to carry out the test procedure with 
parents and for the use of additional test material 
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parents and teachers were handed copies of 
the scale and instructed to rate the children 
as follows: 


1—if the child could always perform what the item 
asked 
if the child could almost always perform the item 

4—if the child could perform the item about half of 
the time 

4—if the child could perform the item once in a 
while 

O—if the child could not perform the item 


The scoring system was communicated to the 
relatively unsophisticated group with prob- 
ably less confusion than if Doll’s had been 
used. Parents and teachers took frequent ad- 
vantage of the writer’s offer to clarify the 
meaning of items. 

The writer was fortunate to be able to se- 
cure previous test material on the children at 
the nursery school. Fairly recent IQs and 
SQs were already available for most of the 
children. The examiners in these cases were 
from three agencies: St. Christopher’s Hos- 
pital, the public school system in Philadel- 
phia, or a community hospital other than St. 
Christopher’s. Therefore, a number of psy- 
chologists were involved in the evaluations of 
the children. But in each case the informant 
on the Vineland was the parent. 

Controlling for examiner bias, Tables 3 and 
4 show that parents consistently rated the 
abilities of their children higher than “rela- 
tively objective observers.” Their SQs were 


Table 3 
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Examination 
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School Children 
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38 30 68 
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Table 4 


Significance Levels of the Differences Between Intelligence and Vineland (Parents and Teachers as Informants 
Test Scores of Nonhandicapped Retarded Nursery-School Children 


Comparisons of Tests 
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Range of 
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fact that the 
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Median 
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+8 points O02 (.004)« 
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5 points 
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ym Teacher B 1 ratings. A mit 


significantly higher than IQs determined by 


psychologists, and higher also than SQs ob- 
tained from the two nursery-school teachers 
who rated the performance of their children. 
SQs obtained from the Vineland ratings of 
Teacher A were not significantly different 
(Ps of .19 and .38 for a one- and two-tailed 
test, respectively) from IQs determined by 
psychologists. SQs of parents as informants 
were significantly higher than SQs of teachers 
as informants. 


Discussion 


The results of this study are interpreted as 
contradicting neither Doll’s nor Schmidt’s 
prior findings. With “relatively objective ob- 
servers” serving as Vineland informants, 
Doll’s contention that IQ and SQ are roughly 
equal is borne out. However, personally in- 
volved informants (as, after all, parents are) 
will introduce a constant bias to disturb the 
IQ-SQ balance, and SQs obtained from these 
informants will average higher than IQs. 

The practical limitations of clinical con- 
tact with retarded children lead to results 


which are more similar to Schmidt’s than 


Doll's, for the psychologist is often compelled 
to rely on the information supplied by par- 
ents. Retarded children are often nonverbal. 
Also, they tend to be more negativistic and 
resistive in the testing context. Thus the ex- 
aminer is frequently faced with a child who 
is quite unresponsive. In such circumstances, 
he naturally turns to the parents for informa- 
tion which might help him evaluate the abili- 
ties and limitations of the child. The results 
of this study are fair warning to the psy- 
chologist not to accept such information at 
face value, but to weigh it against the need 
of parents to portray their children in fa- 
vorable (or, might add, unfavorable) 
light. 

The findings with respect to parents of re- 
tarded children, both with and without mo- 
tor impairment, suggest that the factor of am- 
biguity is significant for the production of 
autistic distortion on the Vineland. The par- 
ents whose children had a motor impairment 
rated them more like “relatively objective ob- 
servers” than the parents whose children were 
comparatively normal in motor performance. 
It was the presumption that there was less 


one 
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perceptual ambiguity present in the parents 
of the first group of children than the second, 
due principally to the presence of motor im- 
pairment in the first group. The “realistic” 
process was presumed operating stronger in 
the parents of the first group than in the 
second. Considered in Freudian terms, in the 
parents of the first group the unconscious 
wish for normalcy lacked an external percept 
to which it could attach and reinforce itself. 

The findings should not, however, be inter- 
preted to mean that the parents of the first 
group did not have an unconscious wish for 
normalcy in their children. They merely sug- 
gest that there was less opportunity available 
for conscious expression of the wish. 

Would parents of children of average or 
better intelligence be free of autistic distortion 
as determined by the Vineland? Doll’s (1953) 
presentation of the results of Ceres’ study 
suggests that they would not, although it was 
not Ceres’ actual intent to demonstrate this. 
Comparing Vinelands, on which the parents 
informed about their children and on which 
the children informed about themselves, she 
found that parents portrayed the children in 
a more favorable light than the children por- 
trayed themselves. Doll (1953) inferred that 

. the results from the mothers were con- 
sistently higher throughout” (p. 479). 

Accepting Murphy’s definition of percep- 
tion as coextensive with cognitive process, the 
writer contends that the parent-child rela- 
tionship is uniquely relevant to the study of 
autistic processes. Further investigation di- 
rected in this area should aim at uncovering 
the dynamic interplay, conscious and uncon- 
scious, between the need systems of parents 
and the perceptual evaluations of their chil- 
dren. 

Summary 

Evidence was presented in this study to 
show that: (a) parents demonstrated a posi- 
tive bias (labeled an autistic distortion) when 


Zuk 


acting as Vineland informants about their re- 
tarded children who were relatively normal 
in motor functioning, but that (4) this trend 
was not found in parents whose retarded chil- 
dren were also handicapped in motor func- 
tioning. Finding a was interpreted as the ef- 
fect of autistic distortion; Finding 6 as an 
operation of what has been termed the “re- 
alistic” process. 

It is felt, moreover, that the findings are 
consistent with the Freudian notion of uncon- 
scious wish fulfillment. In the case of the par- 
ents whose children were not handicapped, 
unconscious (i.e., nonverbalized) desires for 
normalcy could be reinforced by their percep- 
tion of relatively normal motor functioning, 
despite the lag in learning ability. Such de- 
sires were not as liable to external reinforce- 
ment in parents of handicapped children. 
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THE STABILITY OF INTERPRETATIONS OF SENTENCE 
COMPLETION TESTS* 


DONALD W. FISKE ann CHARLES VAN BUSKIRK 


University of Chicago 


In previous studies of repeated administra- 
tions of sentence completion tests, Osterweil 
and Fiske (1956; Fiske, 1957) have found 
that the great majority of responses are 
changed from one testing to the next. In 
terms of manifest content, only a few of a 
person’s responses will be the same on two 
administrations. 

This finding raises a question: if the mani- 
fest content changes so markedly, does the 
personality picture inherent in the protocol 
also change from one time to the next, or does 
the same picture emerge from two protocols 
even though their manifest content is differ- 
ent? Osterweil and Fiske (1956) decided that 
two different protocols from the same person 
would probably lead to slightly different in- 
terpretive pictures. But how should the ex- 
tent and the significance of such differences 
be assessed? In view of the fact that projec- 
tive techniques are commonly used to obtain 
an individualized picture of a person, the 
present authors formulated this specific and 
practical question: are the differences between 
the interpretations of protocols from the same 
person less than the differences between inter- 
pretations of protocols of different persons? 
Thus inter-individual variation, variation be- 
tween individuals, provides a frame of refer- 
ence for evaluating ihe extent oi intra-indi- 


vidual variability, variation or inconsistency 
within a person over time. 


1 We are greatly indebted to Ralph Heine, Joanne 
Powers, and Loren Chapman for serving as judges 
in this experiment. We also wish to thank Sol Gar- 
field for making it possible to collect the Downey 
data. This study was supported in part by a grant 
from the Social Science Research Committee of the 
University of Chicago 


Method 


Our judges interpreted each protocol in 
terms of an ordering of needs with respect to 
salience or strength in the hierarchical con- 
figuration of an individual’s personality. More 
concretely, each judge did a Q sort of 30 
need variables. We used a conceptional frame- 
work developed by Van Buskirk and Yufit 
(1956) to obtain personality profiles, in con 
nection with a study of academic adjustment 
being conducted by the Examiner’s Office at 
the University of Chicago. In this approach, 
a need is a valued goal with characteristic 
feelings and actions associated with it: exam- 
ples are Harmavoidance, Emotionality, Affilia- 
tion, Sentience, and Disjunctivity. Each judge 
was given a list of the variables and their defi- 
nitions.” 

In their Q sorts, the judges used a fixed 
distribution with 10 steps. Q sorts were used 
because the correlation between sorts permits 
a rapid and objective determination of the 
relative agreement or disagreement between 
two interpretations. Such a measure is appro- 
priate because we were interested in the rela- 
tive position of each need, not in its absolute 
strength. 

Subjects. We began by using available pro 
tocols. From a group of University of Chicago 
students tested nine times at weekly intervals 
(Fiske, 1957), we picked four Ss and took 
their first, fifth, and ninth protocols—these 
being obtained at one month intervals. The 
four men formed a sample stratified on intra- 
individual variability: out of 30 men, we se- 
lected the 3rd, 11th, 19th, and 26th in terms 


2 Copies of the variables and their definitions can 
be obtained from the 
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authors 
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of consistency of manifest content over the 
full nine trials. 

At a later time, we decided that our first 
group of Ss might be too homogeneous and 
that our Sentence Completion Test might be 
too short. We then tested four patients at 
Downey VA Hospital, two men and two 
women. These were essentially a random sam- 
ple except that one was taken from each of 
four buildings and each had to be willing to 


Donald W. Fiske and Charles Van Buskirk 


take the test and sufficiently in contact to do 
so. Their ages were 32 to 56. As it happened, 
all four were diagnosed as paranoid schizo- 
phrenics. However, the four case histories 
were quite different. The intervals between 
the successive pairs of the three tests ranged 
from 20 to 38 days. 

Instruments. The Sentence Completion Test 
administered to the Chicago students was a 
25-item form derived from one compiled and 
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Stability of Sentence Completion Test Interpretation 


developed by Osterweil and Fiske (1956). 
The Sacks Sentence Completion Test (Sacks 
& Levy, 1950) was used with the Downey 
patients. It has 60 items. 

Judges. Arrangements were made to permit 
each author to judge each protocol blindly. 
In addition, one experienced clinician judged 
the Chicago protocols, and the Downey pro- 
tocols were judged by one experienced cli- 
nician and by a research psychologist with 
less clinical experience. 

Procedure. The protocols were presented to 
the judges in a random order with respect to 
subject and to the orders of each S’s three 
protocols (first, second, or third testings), 
with the restriction that no pair of protocols 
from the same S were next to each other. Half 
the judges had the same order and the other 
half had this order reversed. The judges did 
their judging in several sessions. 

The 12 sorts made by a judge on one of 
the S samples were intercorrelated. The inter- 
correlations for each judge were analyzed 
separately—the several judges providing rep- 
lications of the basic design. This approach 
made it possible to eliminate interjudge dif- 
ferences from the comparisons of interpreta- 
tions. Furthermore, the primary effect of any 
unreliability in the judge (variation in judg- 
ments over time) would be only to depress 
all the correlations between his interpretive 


sorts. 
Results 


The unit of analysis was the interpretation 
of a single protocol by a single judge. For a 
given judge, we compared this interpretation 
with the other two interpretations from the 
same S and with the nine interpretations from 
the other Ss. The Q sort form of the inter- 
pretations permitted us to compute a corre- 
lation between any two of them. We used the 
median correlation of each class of protocols; 
that is, we compared the median of the two 
within-person correlations with the median of 
the between-person correlations. The results 
are presented in Table 1. 

Among 84 comparisons of this sort, there 
were 21 protocols (25%) where the median 
inter-person correlation exceeded the median 
intra-person correlation. In other words, in 
one-quarter of the instances, the interpreta- 
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tion of a protocol tended to agree better with 
interpretations for other cases than with those 
for the same case. Other analyses of the total 
distributions of the correlations gave a similar 
picture to this result found with the medians: 
about 25% misclassification. 

Note that the criterion was simply whether 
the within-person median is above (or-below) 
the between-person median, not whether it is 
significantly different. 

When we examined the stability of the Ss 
separately, we found there was only one in 
each set of four Ss (A and H respectively) 
who was consistently differentiated by all 
three or all four judges. For the remaining Ss, 
reversals in which the between-person median 
exceeded the within-person median occurred 
in 11% to 44% of the judge-protocol com- 
binations. 

In terms of judges, there were reversals in 
six out of seven instances: only one judge 
successfully differentiated all 12 protocols in 
one set (Judge I for patient Ss). 


Discussion 


There was a possibility that each set of 
cases was perhaps so homogeneous that dif- 
ferentiation between them was very difficult. 
As a check on this, we examined the overlap 
between protocols in terms of their manifest 
content. We counted the number of items on 
which each pair of protocols had the same or 
almost the same response, using a strict cri 
terion for sameness (cf. Fiske, 1957). 

For each protocol, we found the protocol 
with which it overlapped most. For 22 of the 
24 protocols, the highest overlap was with 
another protocol from the same person. For 
the other two, there was little overlap and 
there were tied values, but on a chance basis, 
they would probably be misclassified. Thus 
the manifest content differentiated the indi- 
vidual cases better than the judges’ inter- 
pretations. The same result was obtained in 
a comparison of the median intra-person over- 
lap with the median inter-person overlap in 
manifest content. 

We may therefore conclude that the judges’ 
results are not due to the homogeneity of 
manifest content among the Ss compared: 
the different Ss gave different and distinctive 
responses 
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As another check, we compared the median 
correlations between cases in each group with 
the median of a sample of correlations be- 
tween student cases and patient cases. Again, 
there was no evidence that the homogeneity 
of each group accounts for our results. 

The limited numbers of Ss and judges re- 
strict the confidence in any broad generaliza- 
tion from these findings. However, we feel 
that the data clearly indicate that one proto- 
col differs in manifest content so much from 
other protocols of the same S that its inter- 
pretation will frequently resemble the inter- 
pretations of his other protocols /ess than it 
resembles interpretations for other Ss. This 
finding contradicts the assumption that the 
uniqueness of a single personality can be 
delineated in an interpretation of a sample of 
behavior obtained at one point in time. 

The purpose of this paper is to point out 
the existence of this problem. The complexity 
of the problem is indicated by the consider- 
able variation between Ss and also between 
judges in the extent of differentiation. The 
extent of overlap may also be different for 
other interpretive systems and for other tech- 
niques. It may be less (or more!) for the ex- 


tensive sample of behavior provided by a full 
diagnostic battery. On the basis of our find- 
ings, however, it would seem necessary for 


clinicians performing individual diagnostic 
evaluations to establish that their formula- 
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tions not only differentiate between cases but 
also are stable over time. 


Summary 


Sets of sentence completion protocols were 
judged in terms of a Q sort of need variables. 
Since each set contained three protocols from 
each of four subjects, it was possible to com- 
pare the agreement between interpretations of 
each subject’s three protocols with the agree- 
ment between interpretations for different 
subjects. In 25% of the comparisons, the 
agreement was higher with protocols for dif- 
ferent subjects than with the other protocols 
from the same subject. Thus a single proto- 
col may be an insufficient basis for an inter- 
pretation that differentiates one person from 
other people. 
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BRIEF REPORTS 


SCORE RELIABILITY OF DICHOTOMOUS AND 
TRICHOTOMOUS ITEM RESPONSES ON THE 
MAUDSLEY PERSONALITY INVENTORY ' 


A.W 


University 


Two of the most common procedures in having 
Ss respond to the items on a personality scale 
are the dichotomous-response method, where the 
S responds “Yes-No” or “True-False,” and the 
responses are weighted 1 and 0, and 
the trichotomous-response method, where the S 

owed an response in ad- 

to the two options permitted in the di- 
chotomous-response method and the “Yes-?-No” 
responses are weighted 2, 1, and 0. An example 
item response method is Taylor’s 

Manifest Anxiety Scale, while the second method 
is used for the Extraversion and Neuroticism 
included in Eysenck’s Maudsley Person- 
ality Inventory (1956). Whichever method is 
used appears .o depend upon the preference of 
the constructor of the test with no data being 
on the comparative reliability of each 
One disadvantage of the 
method with 


alternative 


Uncertain” or “?” 


dition 


of the first 


1 
SCales 


ivail ible 
method trichotomous- 


response used machine-scored an- 
swer sheets in research studies is that the scoring 
labor is doubled since the sheets have to be 
for the responses weighted 2 
weighted 1. However, test scores using 
he trichotomous-response method have a larger 
variance and mij 
| 


scored separately 


ind those 


» assumed to be more reli- 
able as a result. 

The standard trichotomous-response form of 
the MPI was administered to 106 pre-education 
students (48 men and 58 women) and a new di- 
(omitting the “?” re- 
was given to 104 similar Ss (35 


The Ss were enrolled in 


chotomous response form 
sponse option 


men and 69 women) 


1 An extended report of this study may be 
without charge from A. W. Bendig 
Psychology, University of Pittsburgh, 
Pa., or for a fee American 
Institute. Order Document No. 5780, 


obtained 
Department of 
Pittsburgh 13 
Documentation 
remitting $1.25 
for microfilm or $1.25 for photocopies 


from the 


BENDIG 


of Pittsburgh 


nine sections of introductory educational psychol 
ogy with approximately half the Ss in each sec 
tion getting each form. The reliability of the E 
ind N scores was estimated by Kuder-Richard 
son Formula 20 

For the dichotomous-response form, the reli 
ability of the E scale was .77 (men) and .79 
while the reliabilities for the N 
(men) and .80 


(women ) scale 


were .88 (women). Comparable 
reliabilities from the trichotomous-response form 
were .74 and .78 for the E scale and .90 and .84 
for the N The reliabilities reported from 


the previous use of 


scale 
the trichotomous-response 
form (Bendig, 19 were .73 and .75 for the E 
86 and .79 for the N scale. The inter 
correlation between the E and N scales for the 
sex groups getting the dichotomous-response for 
18 and —.11 
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were - while the correlations for 
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OF A SOCIALIZED SCALE’? 


DONALD R. PETERSON 
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Vanderbilt University 
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Audy Branch, Montefiore Schools, Chicago, Illinois 


The construct validity of the Socialization (So) 
scale of the California Psychological Inventory 
was extended by confirming previously demon- 
strated findings on the discriminatory effective- 
ness of the scale, and by testing relationships be- 
tween So scores and a number of indices perti- 
hent either to the construct: socialization itself, 
or to some competing interpretative hypothesis. 
The scale was administered to 239 inmates of a 
training school for boys and to a total of 428 
nondelinquent Ss from three different schools. 
From examination of a variety of relationships, 
the following results emerged. 

1. Matching delinquents and nondelinqvents in 
respect to race, age, and place of residence did 
not reduce the screening efficiency of the scale. 
As in earlier studies, the scale correctly identi- 
fied a combined average of over 70% of the 
members of both groups. 

2. “Good citizens” and “disciplinary problems” 
in high school differed significantly from each 
other and from unselected high school students. 

3. Delinquent recidivists differed significantly 
from first offenders. 

1An extended report of this study may be ob 
tained without charge from Donald R. Peterson, 
Psychology Department, University of Illinois, Ur 
bana, Illinois, or for a fee from the American Docu 
mentation Institute. Order Document No. 5775, re- 
mitting $1.25 for microfilm or $1.25 for photocopies 

2Formerly at the University of Illinois. Parts of 
this research were supported by a grant from the 
Institute for Research and Training in the 
Sciences, Vanderbilt University 
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4. A low but statistically reliable correlation 
occurred between So scores and number of court 
contacts prior to commitment to training school 

5. Delinquent Negroes did not differ reliably 
from delinquent whites. 

6. Academically retarded delinquents did not 
differ significantly from those characterized by 
appropriate age-grade placement 

Negroes from Chicago did not differ signifi- 
cantly from white Ss living elsewhere in the state 
of Illinois. 

8. So scores did not correlate significantly with 
areal delinquency rate within the city of Chicago 

9. The correlation between So scores and IQ 
was negligible for delinquents but significantly 
positive for high school students 

10. The correlation between So scores and age 
was negligible for delinquents 
negative for nondelinquents 

In every case, significant relationships occurred 
as one would expect to find them if the So scale 
measures socialization. With the two exceptions 
stated immediately above, relationships pertinent 
to various competing interpretations were not 
significant. In conjunction with previous findings 
on the validity of the So scale, these data rather 
strongly encourage further study of the measure 
and the trait it evidently represents. Studies con- 
cerning the factorial unity of the scale, its pre- 
dictive utility, and its relationship to various 
antecedents seem most clearly indicated. 


but significantly 
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THE RELATIONSHIP BETWEEN PERSONAL AND SOCIAL 
DESIRABILITY SCALE VALUES OF THE EDWARDS 
PERSONAL PREFERENCE SCHEDULE 


LEONARD D. GOODSTEIN anv ALFRED B. HEILBRUN, JR. 


State University of lowa 


The rationale of using social desirability rat- 
ings in personality test research is that all state- 
ments used to describe behavior may be ordered 
along a continuum of social desirability and that 
the position of each self-description along this 
continuum is one of the factors determining the 
probability of endorsement by a test taker. It 
may be argued, however, that the obtained social 
desirability values are estimates of the desirabil- 
ity of behavior in others and that most test re- 
spondents are concerned with the desirability of 
such behavior in themselves. The relationship be- 
tween obtained social desirability scale values and 
values along a continuum of personal desirability 
has been relatively ignored in current research, 
and the present paper is an effort to explore this 
relationship. 

Each of 100 undergraduate Ss was required to 
judge the statements of the Personal Preference 
Schedule (PPS) along a nine-point continuum of 
personal desirability. The instructions conformed 
to those used in standardizing the PPS except 
that the concept of personal desirability was sub- 
stituted for social desirability. The obtained per- 
sonal desirability ratings were transformed into 
scale values by the same method of successive 
intervals used with the PPS. 

The obtained correlation between personal and 
social desirability ratings was .90, indicating that 
the scale values obtained under personal desir- 
ability instructions are very highly correlated 


1An extended report of this study may be ob 
tained without charge from Leonard D. Goodstein, 
Department of Psychology, State University of Iowa, 
Iowa City, Iowa, or for a fee 
Documentation Institute 


from the American 
Order Document No. 5778, 
remitting $1.25 for microfilm or $1.25 for photo- 
copies 


with those obtained under social desirability in- 
structions. 

There was no significant difference between the 
means of the personal and social desirability rat 
ings (t= 0.47 for 133 df). The 
the PPS, however, provide an evaluation of 15 
different needs, each of which is measured by 
nine statements, and an examination of the scale 
values by need category suggested that the over 
all nonsignificance might be masking systematic 
differences within the need categories. To evalu- 
ate this possibility, the 15 


statements ol 


needs were divided 
into two groups of seven needs each, high and 
low personally desirable, on the basis of an in- 
dependently obtained personal desirability rank- 
ing of the needs 

An analysis of the mean differences between 
personal and social desirability ratings indicated 
no difference for those statements in the high 
personally desirable need group, but those state 
ments in the low personal desirability group re- 
ceived higher social than personal desirability 
ratings. The difference between these differences 
2.15 for 124 df, 
pb < .04) indicating an interaction between type 
of desirability rating and the personal desirability 
of the needs involved in such ratings. Thus de 
spite the high correlation between personal and 
social desirability ratings and the nonsignificance 
of the difference between the over-all mean rat 


was statistically significant (¢ 


ings, there was a tendency for personal desir 


ability ratings to be differentially lower than 


social desirability ratings when the statements 
rated were associated with personally undesirable 


needs 
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